How to Judge if a Time Series Is a Martingale? Using the Black–Scholes Model as an Explanation
How to Judge if a Time Series Is a Martingale? Using the Black–Scholes Model as an Explanation
Table of Contents
- 1. Introduction
- 2. The Black–Scholes Model and Geometric Brownian Motion
- 3. Simulation of Multiple Traces
- 4. Advanced Time Series Analysis
- 5. Visualization and Interpretation
- 6. Conclusion
- Appendix A: Welch’s Method and PSD Derivation
- Appendix B: Derivation of the Jensen–Shannon Divergence
- Appendix C: Martingale Process
1. Introduction
The Black–Scholes model, famous for its role in option pricing, assumes that asset prices follow a geometric Brownian motion (GBM). A key property of GBM—and martingales in general—is that the conditional expectation of the future value, given the current information, equals the current value. By simulating asset price paths using the Black–Scholes model, we can study their statistical properties to assess whether they exhibit martingale-like behavior.
In our analysis, we simulate multiple asset price paths over a five-year period and then examine them using two advanced tools:
- Complexity–Entropy Causality Plane: This technique combines permutation entropy and statistical complexity to capture both randomness and hidden structure in the time series.
- Power Spectral Density (PSD) Analysis: Using Welch’s method, we estimate how the variance of the series is distributed across frequencies, revealing characteristic power–law behavior.
2. The Black–Scholes Model and Geometric Brownian Motion
2.1. Stochastic Differential Equation
The Black–Scholes model describes the evolution of an asset price with the stochastic differential equation (SDE):
where:
- is the drift rate,
- is the volatility, and
- is the increment of a standard Brownian motion.
2.2. Derivation via Itô’s Lemma
To solve the SDE, we define and apply Itô’s lemma. This gives:
Substituting the expression for and noting that , we obtain:
which simplifies to
Integrating from to , we have:
Exponentiating yields the solution:
2.3. Discrete Simulation
In practice, we simulate the process by discretizing time. Over a small interval , the discrete model is given by:
with . This recursive formula allows us to generate asset price paths.
3. Simulation of Multiple Traces
For our analysis, we simulate three independent Black–Scholes paths over a five-year period. Assuming approximately 252 trading days per year, this results in around 1,260 time steps per simulation. These multiple traces allow us to compare the dynamics across different realizations and study their collective behavior.
4. Advanced Time Series Analysis
After generating our simulated asset price paths, we apply two advanced analytical techniques to judge their martingale properties.
4.1. Complexity–Entropy Causality Plane
The Complexity–Entropy (CH) causality plane is a tool that plots two key quantities calculated from a time series:
-
Permutation Entropy :
This measure quantifies the randomness in the ordering of values by examining the frequency of ordinal patterns. For a chosen embedding dimension , if is the probability of an ordinal pattern , then:It is often normalized by dividing by , yielding:
Example of Computing :
Consider a short time series: with and .- The first window, , has the ordinal pattern , because the smallest value is , followed by , and then .
- The next window, , is in increasing order, yielding the ordinal pattern .
If these are the only windows, then the probability distribution is:
with zero probability for all other patterns.
-
Statistical Complexity :
This measure combines the normalized entropy with the Jensen–Shannon divergence between the observed distribution and the uniform distribution . The explicit formula is:where is the Jensen–Shannon divergence, detailed in Appendix B.
Plotting against provides a visual representation of how the time series balances randomness with structure—a key indicator of martingale behavior.
4.2. Power Spectral Density Analysis
The PSD of a time series shows how its variance is distributed over frequency. Many stochastic processes follow a power–law:
where is the spectral exponent. For Brownian motion, typically .
To estimate the PSD, Welch’s method is used. This technique:
- Divides the time series into overlapping segments.
- Applies a window function to each segment to minimize spectral leakage.
- Computes the periodogram (squared magnitude of the Fourier transform) for each segment.
- Averages the periodograms to produce a robust PSD estimate.
A linear regression on the log–log plot of the PSD yields an estimate of , which provides further evidence of martingale behavior if it aligns with theoretical expectations.
5. Visualization and Interpretation
In our analysis:
- The left panel displays the three simulated price paths over five years in distinct colors.
- The top-right panel shows the CH–plane trajectories for each trace. Instead of a single point, the accumulated history of and is plotted, revealing how these metrics evolve over time.
- The bottom-right panel presents the PSD spectrum for one simulation on a log–log scale, along with a fitted line and the estimated exponent .
These visualizations help determine whether the time series behaves as a martingale by revealing key characteristics such as the lack of exploitable structure (in the CH–plane) and a PSD exponent consistent with theoretical models.
6. Conclusion
By simulating asset price paths with the Black–Scholes model and analyzing them with the Complexity–Entropy causality plane and PSD estimation via Welch’s method, we can assess whether a time series behaves like a martingale. A martingale process, by definition, has no predictable trends beyond its current value. When the CH–plane indicates high normalized entropy and low statistical complexity, and the PSD exhibits a power–law with an exponent consistent with Brownian motion, these are strong indicators of martingale-like behavior.
Appendix A: Welch’s Method and PSD Derivation
A.1. Welch’s Method
Welch’s method refines the periodogram approach to PSD estimation by reducing variance. The procedure involves:
-
Segmenting the Data:
The time series is divided into overlapping segments (commonly with 50% overlap). -
Windowing:
Each segment is multiplied by a window function (e.g., Hann or Hamming) to reduce spectral leakage. -
Computing Periodograms:
For each windowed segment, the discrete Fourier transform (DFT) is computed, and the squared magnitude yields the periodogram. -
Averaging:
The periodograms are averaged to produce the final PSD estimate:where is the periodogram of the th segment and is the number of segments.
A.2. PSD Derivation
For a stationary process , the theoretical PSD is defined by:
For a finite segment of length with window and sampling interval , the periodogram is given by:
where the normalization factor is:
Averaging over segments yields the PSD estimate.
Appendix B: Derivation of the Jensen–Shannon Divergence
B.1. Definition and Derivation
The Jensen–Shannon divergence (JSD) is a symmetrized and smoothed version of the Kullback–Leibler (KL) divergence. For two probability distributions and , the KL divergence is:
However, the KL divergence is asymmetric and can be infinite if for any where . The JSD is defined as:
where the mixture distribution is:
This definition ensures symmetry and finiteness.
B.2. Explicit Expression for
When comparing a distribution with the uniform distribution (where for outcomes), the JSD becomes:
B.3. Simple Example
Consider a probability distribution over 3 outcomes. The uniform distribution is:
-
Compute the Mixture:
For each outcome , the mixture is:For :
For :
For :
-
Compute the KL Divergences:
Calculate:and
-
Jensen–Shannon Divergence:
Finally, the divergence is:
Appendix C: Martingale Process
C.1. Filtration
A filtration is a family of sigma–algebras
which represents the information available up to each time ( t ). It is an increasing family, meaning that for any ( 0 \le s \le t ),
Intuitively, (\mathcal{F}_t) includes all events (or outcomes) that have occurred by time ( t ). For example, if you are observing the price of a stock over time, (\mathcal{F}_t) would consist of all the historical data, news, and any other relevant information known up to time ( t ).
C.2. Martingale Definition
A stochastic process ( (M_t)_{t \ge 0} ) is called a martingale with respect to the filtration ( (\mathcal{F}t){t \ge 0} ) if it satisfies the following conditions:
-
Integrability:
-
Martingale Property:
For any ( 0 \le s < t ),
This definition captures the essence of a martingale: given all the information available at time ( s ), the best prediction for the value at time ( t ) is the current value ( M_s ). In other words, there is no predictable trend or “drift” in the process.
C.3. A Simple Example: Fair Coin Toss Random Walk
Consider a simple game based on tossing a fair coin. Define a sequence of random variables ( X_1, X_2, \dots ) by
Since the coin is fair, each ( X_i ) has an expected value of zero:
Now, define the cumulative sum (or simple random walk) by
Let the filtration ( {\mathcal{F}n}{n \ge 0} ) be the sigma–algebra generated by the outcomes of the first ( n ) tosses. Then, for any ( n ), the martingale property is verified as follows: