Statistical Validity in Big Data

…there are vastly more possible comparisons than there are data points to compare. Without careful analysis, the ratio of genuine patterns to spurious patterns – of signal to noise – quickly tends to zero.

Tim Harford in the Financial Times has a great article called Big data: are we making a big mistake?.

