Exploratory Data Analysis
Do It All Again … (or at least be able to)
Do Use The Same Problems
- Reproducibility is a key to science (c.f. Cold fusion)
- Being able to do it all again makes it possible
- e.g. storing random seeds used in experiments
- We didn’t do that and might have lost important result
- Being paranoid allows health-checking
- e.g. confirm that ‘minor’ code changes do not change results
- “identical” implementations in C, Scheme, C, gave different results
- Using the same problems can reduce variance