- Markus Breitenbach - http://blog.markus-breitenbach.com -

The cloud obscuring the scientific method

Posted By Markus On July 12, 2008 4:41 pm @ 4:41 pm (July 12, 2008) In Machine Learning, Artificial Intelligence (AI) | No Comments

“All models are wrong, and increasingly you can succeed without them” — George Box
Sometimes…” — Me

In a [1] Wired article about the Peta-byte age of data processing the author claimed that given the enormous amounts of data and the patterns found by data mining we are less and less dependent on scientific theory. This has been strongly disputed (see [2] Why the cloud cannot obscure the Scientific Method) as the author simply ignores the fact that all the patterns that were found are not necessarily exploitable - finding a group of genes that interact is a first step, but won’t cure cancer. However, in machine translation or placing advertising online one can succeed with little to no domain knowledge. That is, once somebody comes up with the right features to use (see [3] Choosing the right features for Data Mining).

What would be interesting to develop, however, is a “meta-learning” algorithm that can abstract from simpler models and learn e.g. a differential equation. For example, lets take data from several hundred Physics experiments about heat-distribution conducted on different surfaces etc. We can probably learn a regression model for one particular experiment which could predict how the heat will distribute given the parameters of the experiment (material, surface etc.). The meta-learning algorithm would then look at these models and somehow come up with the [4] heat-equation. That would be something…


Article printed from Markus Breitenbach: http://blog.markus-breitenbach.com

URL to article: http://blog.markus-breitenbach.com/2008/07/12/the-cloud-obscuring-the-scientific-method/

URLs in this post:
[1] Wired article about the Peta-byte age of data processing: http://www.wired.com/science/discoveries/magazine/16-07/pb_theory
[2] Why the cloud cannot obscure the Scientific Method: http://arstechnica.com/news.ars/post/20080625-why-the-cloud-cannot-obscure-the-s
cientific-method.html

[3] Choosing the right features for Data Mining: http://blog.markus-breitenbach.com/2007/06/01/choosing-the-right-features-for-da
ta-mining/

[4] heat-equation: http://en.wikipedia.org/wiki/Heat_equation

Click here to print.