next up previous
Next: Hints for Using Program Up: Appendix: Some Advice for Previous: Appendix: Some Advice for

Errors in MC and MD

Williams and Holland's Law:

If enough data is collected, anything may be proven by statistical methods.

Suppose we are analyzing the set of configurations, produced in either MC or MD simulations. The total number of configurations is $\tau_{\text{run}}$ The time average of a property A is then
\begin{displaymath}
\langle A\rangle=\frac{1}{\tau_{\text{run}}}\sum_{\tau=1,\tau_{\text{run}}} A(\tau)\end{displaymath} (1)
If each $A(\tau)$ were statistically independent of the others, the variance of the mean would simply be:
\begin{displaymath}
\begin{aligned}
\sigma^2(\langle A\rangle_{\text{run}}) =& \...
 ...1,\tau_{\text{run}}}(A(\tau) - \langle A\rangle)^2\end{aligned}\end{displaymath} (2)
and $\sigma(\langle A\rangle_{\text{run}})$ gives the estimated error in average. But usually the data produce in MD and MC simulations are correlated. As a limiting case we can assume, that our set of $\tau_{\text{run}}$ configurations actually consists of blocks of $\tau_A$identical values (Jacucci and Rahman, 1984, $\tau_A$ corresponds to the correlation time in the MD simulations). Then
\begin{displaymath}
\sigma^2(\langle A\rangle_{\text{run}}) = 2\tau_A \sigma^2(A)/\tau_{\text{run}}\end{displaymath} (3)
To estimate the real error bars we split our data in blocks, varying the block length $\tau_A$, calculate the block averages $\langle A\rangle_b=1/\tau_A \sum_{\tau=1,\tau_A}A(\tau)$ and then estimate the variance, using these block averages:
\begin{displaymath}
\sigma^2(\langle A\rangle_b ) =
\frac{\tau_A}{\tau_{\text{run}}}\sum(\langle A\rangle_b-\langle A\rangle_{\text{run}})^2 \end{displaymath} (4)
As the block size increases, block averages become less and less correlated, and $\sigma^2(\langle A\rangle_b )$ becomes proportional to the number of blocks, or inversely proportional to $\tau_A$. This means, that the value $S = \tau_A \sigma^2(\langle A\rangle_b )/\sigma^2(A)$ becomes independent of $\tau_A$. Each S-th configuration of all produced can be taken as a set of uncorrelated configurations. Thus the error estimate should be based on the $\tau_{\text{run}}/S$ configurations, and not $\tau_{\text{run}}$ itself. That is why efficient MC algorithms are developed to produced less correlated configurations in the same total number of MC steps.


next up previous
Next: Hints for Using Program Up: Appendix: Some Advice for Previous: Appendix: Some Advice for

© 1997 Boris Veytsman and Michael Kotelyanskii
Tue Dec 2 20:15:24 EST 1997