Observations and reanalayses: Our shaky reference

For everyone working on data analysis in climatological science, using references is essential. These references, representing some form of truth, is often the target, which models have to reach. Verification (or in non-meteorological science validation) methodologies evaluate the results against the references and dependent on the methodology deliver good results when the model is near to it, matches its variability or is close in other statistical parameters. The power of these references in these analysis and defining our knowledge about the world is immense and so it is essential that it really has something to do with things we see in front of our windows.

Last month Wendy Parker published a paper named “Reanalyses and Observations: What’s the Difference” and looked at the references from a more philosophical point of view. She listed four points, which critically looked at the connection between references and observations and in this post I would like to take a look at them.

Continue reading


Big Data – More risks than chances?

There is an elephant in the room, at every conference in nearly every discipline. The elephant is so extraordinary that everyone seems to want to watch and hype it. In all this trouble a lot of common sense seems to get lost and especially the little mice, who are creeping around the corners, overlooked.

The big topic is Big Data, the next big thing that will revolutionise society, at least when you believe the advertisements. The topic grew in the past few years into something really big, especially as the opportunities of this term are regularly demonstrated by social media companies. Funding agencies and governments have seen this and put Big Data at their top of their science agenda. A consequence are masses of scientist, sitting in conference sessions about Big Data and discussions vary between the question on what it is and how it can be used. Nevertheless, there are a lot of traps in this field, who might have serious consequences for science in general. Continue reading

The role of statistics in science

Traditionally within the different disciplines of earth science the scientists are divided into two groups: modelers and observationalists. In this view the modellers are those who do theory, possibly with pen and paper alone, and the observationalist go into the field and get dirty hands. That this view is a little bit outdated, won’t be anything new. In my opinion, it really started with the establishment of remote sensing that this division reunited (Yes, reunite, because in the old days, there were a lot of scientists who did everything). As I am a learned meteorologist, from my view it is quite common that this division is not really existent anymore. Both types of scientists sit in front of their computer, both are programming and both have to write papers with a lot of mathematical equations. In other fields, the division might be still more obvious (e.g. Geology), but for many its only the type of data someone is working with, which classify someone as observationalist or modeller. Continue reading

The sampling issue

Observations are generally a tricky thing. Not only are they a special kind of model, which tries to cover a sometimes very complicate laboratory experiment. Additionally they are also representing the truth, as far as we are able to measure it. As a consequence they play a really important part in science, but are in some fields hard to generate.

During the PALSEA2 meeting a question has come up in the context of the generation of paleo-climatic sea-level observations.

Assumed your ressources allow only two measurements, is it better when they be near towards each other or should they be far away.

In the heat of the discussion both sides were taken, but in the end the conclusion was the typical answer for such kind of questions: “it depends on what you want to measure”. Continue reading

Observations represent the truth, models…

In the last year during a larger meeting I had made a comment, which let a lot of attendees shake their head and others just smile. The statement was:

“Observations represent the truth, models the state of our understanding.”

Like I have said before, on the first sight it is of cause rubbish that observations have anything to do with the truth. Indeed, truth is a great word with many different meanings and implications. In the context above “truth” (which anyhow should always set between quotation marks) describes the possible best estimation of the real world by the current available technology in real case situations. When I personally write things up, I usually use a measurement operator to make this clear that observations are never able to describe the full reality. How much effort observers might put at it (and they usually do an amazing job), the real physical state of a physical system can only be approximated. Continue reading

Drawing a line between models and observations

In my last post I showed that observations are models as well.  But when this is the case, why do we distinguish between these two kinds of data the way we do? Why is everyone so keen on observations, when they are just another model output?

The reason can be found usually in their different structure. The amount of modelling, which is applied to an observation to still be called observation should usually be very basic. Coming from the atmospheric sciences myself, the border between the two worlds can often be drawn in the type of the data. Generally the observations in that field are point data, often in situ data, which are irregular in time and space. In contrast to this, model data is usually very regular and sometimes high-dimensional.

Continue reading

All observations are models

Doing statistics between the two worlds of observations and model results lead often to the assumption that both are completely different things. There are the observations, where real people moved into the field, drilled, dug and measured and delivered the pure truth of the world we want to describe. In contrast to this, the clean laboratory of a computer, which takes all our knowledge and creates a virtual world. This world need not necessary have something to do with its real counterpart, but at least it delivers us nice information and visualisation. But this contrast between the dirty observations and the clean models is usually only something, which exists in our heads, in reality they are much more connected to each other.

Continue reading