How It Works
Explore the mathematics and methodology underpinning our v0 predictive model.
1. Wilson Confidence Interval
In sports statistics, a very common error is to overestimate a player's success rate based on too few matches. For example, if a player loses Set 1 and comes back to win in the only recorded match, their simple "comeback rate" would be 100%. Obviously, this is unrealistic.
To correct this small-sample bias, PredictPal calculates the 95% Wilson Confidence Intervals (Score Interval). The formula for the lower ($L$) and upper ($U$) bounds of the proportion is:
Where $\hat p$ is the observed rate (e.g., $k/n$), $n$ is the sample size (e.g., matches where Set 1 was lost), and $z$ is the quantile of the standard normal distribution ($z = 1.96$ for a 95% confidence level).
If a player has few samples ($n$), the Wilson interval widens significantly, alerting us visually to the metric's high uncertainty.
2. Resolution Chain (Lookup Chain)
To guarantee robust estimations, we implement a cascading fallback hierarchy that prioritizes information with the highest statistical relevance:
- 1Matchup History (H2H): If both players have at least 10 recorded matches under the current conditional state, the direct conditional probability of that matchup is used along with its respective Wilson Interval.
- 2Player Average: If the head-to-head matchup has insufficient samples, the model falls back to evaluate the conditional performance of each player separately against the entire league (provided they have played at least 30 matches overall).
- 3League Average: If the player is new or has very few samples in the system, we apply the average transition rates of the entire league (Setka Cup or TT Elite Series).
- 4Constant Value (50%): As an absolute last resort if no league data is available, pure chance is assumed (0.5).
3. Calibration and Brier Score
To audit the quality of our conditional probabilities, we automatically calculate the Brier Score (BS) metric for every set state prediction (Sets 1+2). The Brier Score is defined as:
Where $f_i$ is the conditional probability assigned by our model to the home player's victory, and $o_i$ is the observed binary outcome ($o_i = 1$ if the home player wins the match, $o_i = 0$ if they lose).
A Brier Score of 0.0 represents a perfect prediction (absolute certainty matching the outcome). A model that consistently assigns a 50% probability (pure chance) will get a Brier Score of 0.250. The PredictPal v0 model operates within an optimal calibration range of 0.150 – 0.165, demonstrating a robust predictive edge over chance.