4 Justification to Use Machine Learning

We have demonstrated that prediction is an essential goal in science. Great! But how does machine learning help scientists make predictions? And, what value does it add to existing predictive modeling approaches? What are good reasons for using machine learning in science?

In this chapter, we justify the use of machine learning in science both epistemically and pragmatically.

With the blessing of the Elder Council, tornado prediction became the first machine learning project in Raven Science. Rattle teamed up with the tornado expert Krarah. Krarah was responsible for the data, and Rattle for the training and modeling. Within one month, they had created a model that outperformed all previous tornado prediction models. Eureka! A new paradigm was born.

4.1 Machine Learning has a clear notion of success

In science, the quality of models is often assessed in a lengthy and non-transparent manner. Social aspects such as the reputation of the researchers who developed the model, the opinion of other scientific authorities, or the amount of funding for the research program can play as important a role in model assessment as the predictive capacity of the model [1].

In comparison, machine learning offers a very transparent notion to assess model quality: a good model is one that has a low average prediction error on an unseen test dataset. The idea behind this assessment is that a model with a low empirical error on a test set is expected to make few errors on the task to be solved. This inference from the test to the so-called generalization error is even demonstrably correct if the test data is a random and representative data sample of the task.

Furthermore, not only is the goal of modeling very transparent, but the feedback that researchers receive is also quick to obtain and comes in quantified form. In this way, scientists can check their modeling intuitions throughout the modeling process on an almost purely empirical basis and even compare their models with entirely different modeling schools.

This transparency about what makes a good model enables scientists to work towards a common goal. We discuss the concept of generalization in Chapter 7.

4.2 Machine learning adapts the model to the world

Most approaches where mathematical models are fit to data, for example, statistical models or simulations, come with a constrained functional form. Like assuming a linear, exponential, inverse, or additive relationship between variables. Such constraints usually reflect the domain knowledge of the researcher. For example, if you know that a certain fertilizer is less effective in dry climates, you may add an interaction term.

While incorporating domain knowledge sounds smart, it can be dangerous: Your modeling success will depend on the reliability of your domain knowledge and your skill in encoding it. A researcher who only knows linear models will assume linearity in all contexts, whether it is a conscious choice or not. Misspecified models will lead to wrong conclusions and therefore to bad science [2].

In machine learning, the model classes – like neural networks or random forests – are more flexible and powerful. They can, in principle, approximate any function.¹ You don’t need to know that there is an interaction between humidity and fertilizer effectiveness to build a good model. The algorithm will learn this interaction from the data. There is a shift in burden, the modeling success rests not on your domain knowledge anymore but on the size and quality of your data. Equipped with enough data, even non-domain experts can find highly predictive models. Indeed, there is a trade-off: with lots of data, domain knowledge is secondary; with little data, on the other hand, domain knowledge is key.

We will discuss the role of domain knowledge and how to incorporate it in machine learning modeling in Chapter 8.

4.3 Machine learning handles various data structures

The more complex the data structures become, the more difficult it becomes to make assumptions and follow a traditional modeling approach. This becomes evident if we leave the realm of tabular data and consider images, text, and geospatial data. This is where machine learning can shine. Machine learning successfully handles various data types, such as tabular [5], image [6], geospatial [7], sound [8], graphs [9], or text data [10]. There is often lots of data, but it’s not always in perfect shape:

Most data wasn’t produced by controlled experiments but comes from the messy world.
Data is often high-dimensional without indication of what’s relevant and what’s not.
Data can come in various forms (text, images, audio, etc.) and is not restricted to tables with clearly interpretable columns.

Unlike approaches such as classic statistical modeling, machine learning can deal with these problems. All you need is data! Well, we exaggerate a bit here, but you get the idea. Dealing with diverse data is not merely practical but also epistemic. If diverse data sources point to similar insights, this strongly supports the validity of these insights [11].

4.4 Machine learning allows you to work on new questions

Machine learning has enabled scientists to leverage data to solve specific problems that they hadn’t been able to “solve” before using other approaches:

Translating texts: Machine translation is older than machine learning, but, let’s be honest, it was too bad to be useful. Before machine learning was mature enough and embraced, rule-based systems and statistical machine translation were used.
Diagnosis of diseases based on X-ray images: Before machine learning, techniques like template matching and rule-based algorithms were used. Only machine learning made it possible to make huge progress in this task.
Drug discovery: Traditionally, drug discovery involved a lot of trial and error. Machine learning has made it possible to analyze huge amounts of data and predict how different drugs might interact with targets in the body. In the future, this may greatly accelerate the process of drug discovery.

For such research questions, machine learning is finally offering routes toward studying them systematically, built on empirical data. Paraphrasing Wittgenstein – the limits of my modeling are the limits of my world. This implies that extending the limits of modeling with machine learning extends the realm of phenomena scientists can investigate.

4.5 Don’t undervalue machine learning because it is new

Compared to other modeling approaches, machine learning has a short history. Differential equations or linear models, for instance, have existed for hundreds of years. Historically, new modeling approaches like machine learning always had a hard time establishing themselves:

Einstein’s theory of relativity relied on non-Euclidean geometry – an unusual branch of mathematics for these endeavors – which was one of the reasons why he was met with skepticism.
Probabilistic models were first completely rejected in linguistics due to the dominance of logical methods. Today, they are well-established.
In econometrics, graphical causal models have a hard time asserting themselves against the prevailing framework of potential outcomes.

The novelty of an approach should neither be taken as an argument in favor nor against using a certain modeling approach. The essential question is: Can the tool help answer the question you have set out to address?

4.6 Further justifications for machine learning

There are further justifications for using machine learning in science:

Time efficiency: Accurate machine learning models can often be built time-efficiently by researchers. There is less pre-processing needed compared to other approaches and large parts of the model selection and training process can be automated. Also, there is great support for machine learning packages in popular programming languages like Python (e.g., scikit-learn, PyTorch, or TensorFlow) or R (e.g., mlr3 or tidymodels).
Computational efficiency: In some situations, machine learning can be computationally cheaper than traditional modeling. Machine learning models can, for example, be used as fast surrogate models for simulations from physics or material science that are computationally very intense [12]. Machine learning weather prediction can run on your laptop, unlike numerical methods that demand the world’s biggest computing clusters [13].
Basis for theory: Machine learning may support you with the knowledge needed to build classical theory-based models. They may show which features contain predictive information and how features interact.
Effective for operationalized goals: Machine learning is ideal if the aim can be easily encoded in a single metric. If you are confident in your metric, machine learning will provide you with the means to optimize for it.

[1]

T. S. Kuhn, The structure of scientific revolutions, vol. 962. University of Chicago press Chicago, 1997.

[2]

B. Dennis, J. M. Ponciano, M. L. Taper, and S. R. Lele, “Errors in statistical inference under model misspecification: Evidence, hypothesis testing, and AIC,” Frontiers in Ecology and Evolution, vol. 7, p. 372, 2019, doi: 10.3389/fevo.2019.00372.

[3]

K. Hornik, “Approximation capabilities of multilayer feedforward networks,” Neural networks, vol. 4, no. 2, pp. 251–257, 1991, doi: 10.1016/0893-6080(91)90009-T.

[4]

Y. Lu and J. Lu, “A universal approximation theorem of deep neural networks for expressing probability distributions,” Advances in neural information processing systems, vol. 33, pp. 3094–3105, 2020, doi: 10.5555/3495724.3495984.

[5]

Y. Gao et al., “Machine learning based early warning system enables accurate mortality risk prediction for COVID-19,” Nature communications, vol. 11, no. 1, p. 5033, 2020, doi: 10.5281/zenodo.3991113.

[6]

B. J. Erickson, P. Korfiatis, Z. Akkus, and T. L. Kline, “Machine learning for medical imaging,” Radiographics, vol. 37, no. 2, pp. 505–515, 2017, doi: 10.1148/rg.2017160130.

[7]

X. Ren et al., “Deep learning-based weather prediction: A survey,” Big Data Research, vol. 23, p. 100178, 2021, doi: 10.1016/j.bdr.2020.100178.

[8]

T. Oikarinen et al., “Deep convolutional network for animal sound classification and source attribution using dual audio recordings,” The Journal of the Acoustical Society of America, vol. 145, no. 2, pp. 654–662, 2019, doi: 10.1121/1.5097583.

[9]

W. Hu et al., “Open graph benchmark: Datasets for machine learning on graphs,” Advances in neural information processing systems, vol. 33, pp. 22118–22133, 2020, doi: doi/10.5555/3495724.3497579.

[10]

M. Neethu and R. Rajasree, “Sentiment analysis in twitter using machine learning techniques,” in 2013 fourth international conference on computing, communications and networking technologies (ICCCNT), IEEE, 2013, pp. 1–5. doi: 10.1109/ICCCNT.2013.6726818.

[11]

J. Earman, “Bayes or bust? A critical examination of bayesian confirmation theory.” MIT Press, 1992.

[12]

J. Q. Toledo-Marı́n, G. Fox, J. P. Sluka, and J. A. Glazier, “Deep learning approaches to surrogates for solving the diffusion equation for mechanistic real-world simulations,” Frontiers in Physiology, vol. 12, p. 667828, 2021, doi: 10.3389/fphys.2021.667828.

[13]

R. Lam et al., “Learning skillful medium-range global weather forecasting,” Science, p. eadi2336, 2023, doi: 10.1126/science.adi2336.

Many hypotheses classes in machine learning are universal approximators of some form; that is, they lie dense within the space of (continuous/step) functions [3], [4]. While this is not a super special property, e.g., also polynomials lie dense in the space of continuous functions, it at least expresses that there is in principle a model that approximates any possible relationship well. This is different compared to classical statistical models that rely on small model classes; even the best model within such a class may be overall a bad model to capture the true relationship between predictors and target.↩︎