Skip to main content

Don't call Big Data a Revolution

Everybody in science seems to love Big Data. Put "Big Data" in your grant proposal and your file gets on top of the pile. Sure, some had the suspicion that funding for operating with big data went up because those nerds in the basement of NSA need some help sifting through cassettes of indiscriminate tapping into every utterance of every two-legged creature on earth. Those losers obviously lack the brains to ask the right questions and to target a reasonable subset of mankind - so they just grab everything they get. And stay as blind as they were before. Of course it is difficult to find a needle in a haystack - but why dump all that hay on the needle in the first place?
This aside, there are believers like Kenneth Cukier and Viktor Mayer-Schoenberger, authors of "Big Data: A Revolution that will transform how we live, work and think" (Houghton Mifflin Harcourt, 2013), who marvel at the transition from trying to approach a mechanism in nature with smart experiments to prediciting the future behaviour of the system by merely observing and describing patterns. They call the interest in correlation (and not causation) a paradigm-shift, a revolution and nothing less than the future of science at large.
They are probably right.
And this is scary.
If you just want to get an idea of potential traffic jams depending on location and time of day, big data might help. If you need to know if your medication cures or kills, excellent statistics will do. Big data ultimately brings you from statistics of small numbers to 'N=ALL'. 
The main drive of science always was - and always will be - curiosity for the mechanism, the 'why?'.
Recording huge amounts of data - all data available - does not solve any problems. In the worst case it substitutes understanding with describing.
But in the best case, the mapping of a system on as many related data as possible can be seen as lifting it from nature to the lab. The really Big Data that contain *all* correlations would be a transposition of the real thing that then can be experimented on. See Big Data as the score-sheet to a symphony, plus information on the instruments, plus the acoustics, plus the musicians, plus the atmosphere, plus...
(I am off to the lab)


Sandor Ragaly said…
fantastic post - for *small* is smart (and beautiful also, partially :-) ).

Popular posts from this blog

Academics should be blogging? No.

"blogging is quite simply, one of the most important things that an academic should be doing right now" The London School of Economics and Political Science states in one of their, yes, Blogs . It is wrong. The arguments just seem so right: "faster communication of scientific results", "rapid interaction with colleagues" "responsibility to give back results to the public". All nice, all cuddly and warm, all good. But wrong. It might be true for scientoid babble. But this is not how science works.  Scientists usually follow scientific methods to obtain results. They devise, for example, experiments to measure a quantity while keeping the boundary-conditions in a defined range. They do discuss their aims, problems, techniques, preliminary results with colleagues - they talk about deviations and errors, successes and failures. But they don't do that wikipedia-style by asking anybody for an opinion . Scientific discussion needs a set

Left Brain, Right Brain

At a wonderful summer night I was lying in the grass, my little son beside me. We were staring into the dark sky, debating infinity, other planets, the origin of everything, observing falling stars that were whizzing through the atmosphere at a delightfully high rate. Why did we see so many of them that night? What are falling stars? What are comets. Why do comets return and when? The air was clear and warm. No artificial lights anywhere. The moon was lingering lazy in the trees across the river. Some fireflies were having a good time, switching their glow on and off rather randomly - in one group they seemed to synchronize but then it was random again. It reappeared: a few bugs were flashing simultaneously at first ... it started to expand, it was getting more. A whole cloud of insects was flashing in tune. Are they doing this on purpose? Do they have a will to turn the light on and off? How do those fireflies communicate? And why? Do they communicate at all? My son pointed at a fie

My guinea pig wants beer!

Rather involuntary train rides (especially long ones, going to boring places for a boring event) are good for updates on some thoughts lingering in the lower levels of the brain-at-ease. My latest trip (from Berlin to Bonn) unearthed the never-ending squabble about the elusive 'free will'. Neuroscientists make headlines proving with alacrity the absence of free will by experimenting with brain-signals that precede the apparent willful act - by as much as seven seconds! Measuring brain-activity way before the human guinea pig actually presses a button with whatever hand or finger he desires, they predict with breathtaking reproducibility the choice to be made. So what? Is that the end of free will? I am afraid that those neuroscientists would accept only non-predictability as a definite sign of free will. But non-predictability results from two possible scenarios: a) a random event (without a cause) b) an event triggered by something outside of the system (but caused).