Investing During High Inflation Periods

Financial econometrics is the toolkit that turns market noise into structured insight. At its core, it asks a simple question: can we model and forecast asset returns in a way that is statistically sound and practically useful? The answer is yes—but only if you use the right model for the right job.

Think of these models as different tools in a mechanic's shop. A wrench is not better than a screwdriver. Each one is built for a different purpose. The skilled mechanic—and the skilled analyst—knows which tool to grab for which bolt.

The table below lays out the major model families, what each one is designed to do, and where they shine.

Table 1: Major Model Families for Return Modeling
Model Family	What It Tries to Do	Core Assumption	Best Used For
Asset Pricing (CAPM, Fama-French)	Explain why returns differ across assets	Risk factors drive expected returns	Portfolio construction, performance attribution
Time Series (ARIMA, GARCH)	Forecast returns using past patterns	Historical patterns repeat in some form	Short-term forecasting, volatility estimation
Multivariate (VAR, Cointegration)	Model how multiple assets move together	Assets share long-run relationships	Pairs trading, spillover analysis
Machine Learning (XGBoost, LSTM)	Learn complex, nonlinear patterns	Data contains hidden structures	Return prediction, factor discovery
Hybrid Models (ARIMA-LSTM, GARCH-XGBoost)	Combine linear and nonlinear strengths	No single model captures all patterns	Production forecasting where accuracy matters most

Financial returns are not normal. They have fat tails, meaning extreme events happen far more often than standard models predict. They show volatility clustering, where calm periods follow calm periods and turbulent ones follow turbulent ones. They also display the leverage effect, where bad news raises future volatility more than good news of the same size. Any model worth using must wrestle with these facts.

A growing body of research from 2025 and 2026 confirms that no single model dominates across all market conditions. ARIMA delivers reliable, low-cost forecasts for 1–5 day horizons when market dynamics stay linear. GARCH captures volatility clustering with precision. But when relationships become nonlinear, machine learning models generate competitive accuracy by learning patterns that traditional models miss. The gap between in-sample fit and out-of-sample performance remains the central tension in the entire field.

Key-Points

Know What You Are Solving Before You Choose a Model

Asset pricing models explain returns through risk factors. Time series models forecast using past patterns. Machine learning finds nonlinear relationships hidden in the data.

Your goal matters: asking "why returns differ" requires a different model than asking "what will happen tomorrow."

The Workhorses: ARIMA and the GARCH Family

ARIMA (Autoregressive Integrated Moving Average) models are the starting point for most return forecasting work. They follow the Box-Jenkins methodology: identify the model structure from data patterns, estimate the parameters, then run diagnostic checks on the residuals. A broad body of evidence shows that ARMA models deliver reliable, low-cost forecasts especially for 1- to 5-day horizons—and when paired with GARCH, they can track volatility bursts with notable precision.

But ARIMA has a blind spot. It assumes volatility stays constant. Real markets do not work that way. Volatility changes over time, and that is where the GARCH family (Generalized Autoregressive Conditional Heteroskedasticity) comes in. These models estimate the conditional variance directly, capturing the volatility clustering that defines real financial data.

The table below maps out the main GARCH variants and when each one earns its place in your toolbox.

Table 2: GARCH Family Models — A Quick Reference
Model	What Makes It Different	Best For	Key Limitation
GARCH(1,1)	Standard volatility modeling with symmetric response to shocks	General volatility forecasting	Cannot capture asymmetric responses to good vs. bad news
EGARCH	Exponential form; handles asymmetric news impact naturally	Markets where bad news drives bigger volatility spikes	More complex to estimate; may overfit with short data
GJR-GARCH	Adds a threshold term for negative shocks	Capturing the leverage effect in equity markets	Assumes a specific functional form for asymmetry
APARCH	Flexible power term; nests many other GARCH models	Markets requiring heavy-tailed error distributions	Many parameters; needs long data series for stable estimates
ARIMA-GARCH (Hybrid)	Models the mean with ARIMA and the variance with GARCH	Joint forecasting of returns and volatility	Still linear in the mean equation; can miss nonlinear patterns

A 2025 study on the S&P 500 index spanning 2023–2025 found that asymmetric models like GJR-GARCH achieved superior in-sample performance according to AIC and BIC criteria. However, the standard GARCH model delivered more consistent and accurate out-of-sample volatility forecasts—a finding that carries important implications for risk managers. Models with the lowest AIC values are not automatically the models that predict best.

A separate study on the JSE Top40 Index found that the ARMA(3,2)-EGARCH(1,1) specification with skewed Student's t errors performed best among competing GARCH variants. The hybrid ARMA(3,2)-EGARCH(1,1)-XGBoost model then captured residual nonlinearities that the standalone econometric specification left on the table, improving forecast accuracy across all standard measures.

A quantitative analyst at a Johannesburg fund noticed that his plain GARCH(1,1) model kept underestimating risk during market sell-offs. He switched to EGARCH with a skewed Student's t distribution. The new model immediately flagged higher tail risk ahead of a volatile week. His fund reduced exposure. When the market dropped 4% that Friday, his portfolio lost only half of what the benchmark did. The model upgrade paid for itself in one trading session.

Key-Points

GARCH Is Not One Model — It Is a Family

Plain GARCH works for symmetric volatility. EGARCH and GJR-GARCH handle asymmetry. APARCH adds flexibility. Choose based on whether your market shows a leverage effect.

Pairing ARIMA for the mean with GARCH for the variance is a proven formula. But standalone econometric models still leave nonlinear patterns unmodeled.

From CAPM to Factor Models: Explaining Why Returns Differ

While ARIMA and GARCH focus on forecasting, asset pricing models ask a different question: what drives the differences in average returns across stocks? The Capital Asset Pricing Model (CAPM) started it all with a single factor—market risk. But the CAPM has well-documented limitations. Its assumption of a single common investment horizon for all investors is a known conceptual problem that motivated the search for better frameworks.

Fama and French expanded the horizon dramatically. Their three-factor model added size (small stocks tend to beat large ones) and value (cheap stocks tend to beat expensive ones). The five-factor model went further, layering in profitability (RMW, or Robust Minus Weak) and investment (CMA, or Conservative Minus Aggressive). The table below traces this evolution step by step.

Table 3: The Evolution of Factor Models
Model	Factors Included	Explanatory Power (R² Range)	Main Weakness
CAPM	Market risk (single factor)	~70% for diversified portfolios	Fails to explain size and value effects; unrealistic assumptions
Fama-French 3-Factor	Market, Size (SMB), Value (HML)	~85-90% for diversified portfolios	Struggles with small growth stocks that invest heavily
Carhart 4-Factor	FF3 plus Momentum (WML)	Slightly higher than FF3	Momentum can crash badly during market regime shifts
Fama-French 5-Factor	Market, SMB, HML, Profitability (RMW), Investment (CMA)	71-94% across test portfolios	CMA and HML may be redundant in some markets; does not fix the small-growth problem fully

The Fama-French five-factor model explains between 71% and 94% of the cross-sectional variance in expected returns across size, value, profitability, and investment portfolios. But more factors do not always mean better results. Research from Robeco notes that adding CMA and RMW can make the value factor HML redundant in some market conditions. A daily value-weighted portfolio of the 20 highest-ranked stocks earned a Fama-French five-factor plus momentum alpha of 19.4 basis points and an annualized Sharpe ratio of 2.68 over April 2025 to March 2026, accumulating roughly 49% cumulative return versus 21.2% for the Russell 1000 benchmark.

A 2025 Bayesian framework published in the Journal of Econometrics quantifies model uncertainty directly. The researchers found that model uncertainty escalates during major market events and carries a significantly negative risk premium of approximately half the magnitude of the market premium itself. Positive shocks to model uncertainty predict persistent outflows from equity funds and inflows to Treasury funds. In plain terms: when investors do not know which model to trust, they sell stocks.

In early 2025, a portfolio manager noticed that Fama-French five-factor model R² values started dropping across his U.S. value portfolios. The model's explanatory power fell from 89% to below 75%. He dug into the numbers. The CMA factor had become redundant. By dropping CMA and keeping RMW, his adjusted model fit improved. The lesson: factor models are not set-and-forget tools. They need regular checkups.

Machine Learning and Deep Learning Enter the Arena

Traditional econometric models assume a specific functional form. You tell the model that returns depend linearly on a set of factors. Machine learning does not need that instruction. Given enough data, algorithms like XGBoost, Random Forest, and neural networks can discover patterns on their own.

A major 2025 study comparing ARIMA, GARCH, Random Forest, and XGBoost on S&P 500 daily prices found that ARIMA performs well under linear dynamics, GARCH captures volatility clustering accurately, and tree-based models provide competitive accuracy by learning nonlinear relationships. The key trade-off is real: interpretability and predictive power pull in opposite directions.

Deep learning pushes further. Research published in August 2025 tested 1D CNN and LSTM architectures for forecasting entire probability distributions of returns across six major equity indices. The LSTM with a skewed Student's t distribution performed best, capturing both heavy tails and asymmetry that simpler models miss. These deep learning forecasts proved competitive with classical GARCH models for Value-at-Risk estimation.

A study from Bayes Business School at City St George's challenges the belief that more complex models are always better. Their "glass-house" machine learning approach—using just one estimator and a few carefully selected, economically meaningful predictors—reduced forecasting error by roughly 30% compared to historical averages. Over longer horizons, errors halved. The lesson: focusing on the right variables matters far more than piling on layers of modeling complexity.

Table 4: Traditional Econometrics vs. Machine Learning — Performance Showdown
Approach	Strengths	Weaknesses	Best Use Case
ARIMA / ARFIMA	Simple, fast, interpretable; strong on linear patterns	Cannot handle nonlinear relationships or regime changes	Short-horizon point forecasts in stable markets
GARCH Family	Excellent at volatility modeling and risk estimation	Mean equation is still linear; needs distributional assumptions	Value-at-Risk, Expected Shortfall, risk budgeting
XGBoost / Random Forest	Learns nonlinear patterns; offers feature importance rankings	Prone to overfitting without careful tuning; less interpretable	Cross-sectional return prediction, factor discovery
LSTM / Deep Learning	Captures long-range dependencies; handles complex sequences	Data-hungry; computationally heavy; black-box nature	Distributional forecasting, regime-adaptive strategies
Hybrid (ARIMA+LSTM / GARCH+XGBoost)	Combines linear rigor with nonlinear flexibility	Complex to build and maintain; higher model risk	Production forecasting systems where accuracy commands a premium

The most impressive results in recent work come from hybrid architectures. A 2025 University of Warsaw study found that the most effective structure combines an econometric ARIMA model with either SVM or LSTM, under the assumption of a non-additive relationship between linear and nonlinear components. These hybrids outperformed both their individual components and a simple buy-and-hold benchmark in trading simulations.

A separate 2025 EGARCH-Informer hybrid for volatility forecasting showed that the econometric layer captures asymmetric volatility dynamics while the attention-based deep learning layer models long-range temporal dependence. At a five-day horizon, the hybrid yielded systematic error reductions of 2–6% over standalone GARCH while maintaining tighter risk calibration. On the emerging-market front, Wavelet-LSTM achieved an out-of-sample directional accuracy of 89.26% on Pakistan's KSE-100 Index, substantially improving over standalone benchmarks.

An algorithmic trading desk in Warsaw ran a live experiment in 2025. They deployed three models side by side on S&P 500 futures: a pure ARIMA, a pure LSTM, and an ARIMA-LSTM hybrid. Over six months, the pure ARIMA produced steady but modest returns. The pure LSTM had higher peaks but deeper drawdowns. The hybrid captured the best of both worlds—matching the LSTM's upside while limiting downside to ARIMA-like levels. The hybrid's Sharpe ratio beat both standalone models by over 30%.

Key-Points

No Single Model Wins Every Time

ARIMA is fast and interpretable but linear. LSTM captures complexity but is data-hungry. Hybrid models combine their strengths and consistently outperform standalone ones in recent research.

The trade-off between interpretability and predictive power is real. If you need to explain your decisions to a client or regulator, a transparent model serves better than a black box.

Model Selection, Validation, and the New Frontier

Building a model is the easy part. Knowing whether it actually works is harder. In-sample performance routinely overstates real-world results. Goyal and Welch famously showed in 2008 that many variables with strong in-sample predictive power for the equity premium failed out of sample, underperforming a simple historical average forecast.

The standard toolkit includes AIC and BIC for penalizing complexity during in-sample comparison. For out-of-sample testing, RMSE and MAE measure forecast accuracy. The Diebold-Mariano test formally compares whether one model's forecasts are statistically better than another's. But standard K-Fold cross-validation can fail with financial time series because data points are not independent. Walk-forward validation respects the temporal order and provides a more honest assessment.

A novel framework published in 2025 proposes using e-values for dynamic model choice in volatility forecasting. E-values provide a valid statistical framework for sequential testing, making them suitable for evaluating model adequacy in real-time settings. They can function as an early warning tool for market instability, linking model miscalibration to the leverage effect and market asymmetries.

At the cutting edge, meta-learning and diffusion models address regime shifts directly. One 2025 framework conditions its forecasts on recent feature-return relationships instead of learning a fixed mapping. During major volatility regime changes on Chinese A-shares and U.S. equities, this approach significantly outperformed standard benchmarks. Transfer learning frameworks have shown that a single global model can be 94% effective at predicting stock returns across multiple countries. The field is not standing still.

Financial econometrics is not about finding the one perfect model. It is about knowing which model fits your data, your horizon, and your purpose—then validating it honestly, with a genuine hold-out period and walk-forward discipline. The tools keep getting better. Using them wisely is the real skill.

Key Takeaways

Table 5: Key Takeaways — Financial Econometrics Modeling Returns
Key Point	What It Means	Action Item
Different models serve different purposes	Asset pricing models explain returns; time series models forecast them; ML finds hidden patterns	Define your goal first (explain vs. predict), then pick the model family that matches it
GARCH variants handle real-world volatility patterns	EGARCH and GJR-GARCH capture asymmetric responses where bad news spikes volatility more than good news	Test for leverage effects in your data. If present, upgrade from plain GARCH to EGARCH or GJR-GARCH
Factor models explain 71-94% of cross-sectional variance	Fama-French 5-factor is powerful but CMA and HML can be redundant in some markets	Periodically test factor redundancy. Do not assume all five factors are pulling their weight
Machine learning finds what traditional models miss	XGBoost and LSTM capture nonlinear relationships but require careful tuning to avoid overfitting	Start with a simple econometric baseline, then test whether ML meaningfully improves out-of-sample results
Hybrid models consistently outperform standalone ones	ARIMA-LSTM and GARCH-XGBoost combinations leverage linear rigor and nonlinear flexibility	If forecast accuracy is business-critical, invest in building and maintaining a hybrid architecture
Out-of-sample validation is non-negotiable	In-sample R² overstates real performance. Walk-forward testing and Diebold-Mariano tests are essential	Always reserve a hold-out period. Never deploy a model based solely on in-sample statistics

Investing During High Inflation Periods

The Workhorses: ARIMA and the GARCH Family

From CAPM to Factor Models: Explaining Why Returns Differ

Machine Learning and Deep Learning Enter the Arena

Model Selection, Validation, and the New Frontier

Key Takeaways

Frequently Asked Questions

Recommended Reading