Skip to main content

Meta-Mining

Some so-called 'internet-prophets' bemoan the increasing volume of web-babble, the deluge of chatter, the hollowness of the information-tsunami. Big words of cultural pessimism that are gratefully picked up by the media.
Those web-critics have a serious problem: they try to *read* all that.
Would they go into a library and start reading the very first book on the shelf? I hope not. When they open Encyclopedia Britannica (yes there are some printed versions around) do they start reading on page 1? Some try to survive in the web by suggesting a new order of information - an ordering according to the date of appearance - the life-streams (see David Gelernter on Edge.org) . This would be an order in time instead of 'space' (where data are conventionally mapped out in different 'locations' on your screen or hard-drive).This approach to clean the data-mess is reminiscent of the cleansing of Augias' stables by diverting the River Alpheus. It's an honorable and classic approach - but does it solve the problem?
Let's look at Twitter. The deafening babble of tweets is already organized in life-streams. Read them live and you will drown.
The solution - besides filtering (friends, topics, lists, labels...) - can not lie in organizing the individual byte-series along one or the other axis (time, space, size, language...), the solution will rather be a mining of the meta-information. If a twitterer posts the unavoidable 'I am off to the loo, be back in a minute', this might only interest the one waiting for a response. If she posts that 20 times a day, we get some additional information: there might be the indication of a physiological problem.
Some meta-mining of tweets is approaching commercial relevance as reported by Jessica Guynn and John Horn in the LA times of April 2, 2010. Computer models based on Twitter chatter, they write, are stunningly accurate in predicting the box-office success of Hollywood movies.
If in the web to be the noise of individual utterances will be systematically analyzed for overlying macro-structures and for phase-transitions from the purely random to the organized, there will be more information gained than individually and knowingly put in. The sheer boundless chatter of Twitter and alike corresponds to the cells, the web is the organism.
If we continue looking at the lion through a microscope, we might get a pretty good understanding of his cells and the breathtaking number of them - but we might miss that we are just about to get eaten.

Comments

Popular posts from this blog

Academics should be blogging? No.

"blogging is quite simply, one of the most important things that an academic should be doing right now" The London School of Economics and Political Science states in one of their, yes, Blogs . It is wrong. The arguments just seem so right: "faster communication of scientific results", "rapid interaction with colleagues" "responsibility to give back results to the public". All nice, all cuddly and warm, all good. But wrong. It might be true for scientoid babble. But this is not how science works.  Scientists usually follow scientific methods to obtain results. They devise, for example, experiments to measure a quantity while keeping the boundary-conditions in a defined range. They do discuss their aims, problems, techniques, preliminary results with colleagues - they talk about deviations and errors, successes and failures. But they don't do that wikipedia-style by asking anybody for an opinion . Scientific discussion needs a set

Left Brain, Right Brain

At a wonderful summer night I was lying in the grass, my little son beside me. We were staring into the dark sky, debating infinity, other planets, the origin of everything, observing falling stars that were whizzing through the atmosphere at a delightfully high rate. Why did we see so many of them that night? What are falling stars? What are comets. Why do comets return and when? The air was clear and warm. No artificial lights anywhere. The moon was lingering lazy in the trees across the river. Some fireflies were having a good time, switching their glow on and off rather randomly - in one group they seemed to synchronize but then it was random again. It reappeared: a few bugs were flashing simultaneously at first ... it started to expand, it was getting more. A whole cloud of insects was flashing in tune. Are they doing this on purpose? Do they have a will to turn the light on and off? How do those fireflies communicate? And why? Do they communicate at all? My son pointed at a fie

My guinea pig wants beer!

Rather involuntary train rides (especially long ones, going to boring places for a boring event) are good for updates on some thoughts lingering in the lower levels of the brain-at-ease. My latest trip (from Berlin to Bonn) unearthed the never-ending squabble about the elusive 'free will'. Neuroscientists make headlines proving with alacrity the absence of free will by experimenting with brain-signals that precede the apparent willful act - by as much as seven seconds! Measuring brain-activity way before the human guinea pig actually presses a button with whatever hand or finger he desires, they predict with breathtaking reproducibility the choice to be made. So what? Is that the end of free will? I am afraid that those neuroscientists would accept only non-predictability as a definite sign of free will. But non-predictability results from two possible scenarios: a) a random event (without a cause) b) an event triggered by something outside of the system (but caused).