Correlation between Recall Probability Estimation Error and Sample Size

Under ideal conditions where recall outcomes are binary (remembered or forgotten), the distribution follows a binomial model. Utilizing frequency to estimate probability yields an unbiased estimate, with the primary source of error stemming from the variance of the binomial distribution.

$Var(X) = N \cdot p \cdot (1-p)$
- X represents the number of successes
- N denotes the total number of trials
- p denotes the true probability of success
The average variance per experiment is $\cfrac{p(1 - p)}{N}$
The standard deviation can be used to estimate the magnitude of error: $\sqrt{\frac{p(1-p)}{N}}$
In the worst-case scenario, where p equals 0.5, the error is $\cfrac{1}{2\sqrt{N}}$

To achieve an error margin of 0.01 when p = 0.5, at least 2,500 trials are needed.