I’m beginning to suspect that the notion of entropy outside the physical domain is simply a mathematical construct and may have nothing really to do with “information” as the word is generally accepted (unless you change the numeraire and define information as entropy). I claim that we can work with it as a mathematical construct (like the notion of variance), but to try and assign a deeper meaning or a connection to information may be going too far. (This does not preclude me from claiming that the mathematical construct called entropy is increasing).
The size of a zipped-file containing the text of Eco’s Foucault’s Pendulum does not represent the information contained within that text. The notion of information is relative to the reader and is at least tangentially related to the concept of “meaning.”
Shannon’s entropy is a measure only of the probability of a specific message amongst all possible messages. It is not a function of the transmitter or the receiver, only the channel. But, for someone who speaks no Korean, the Korean translation of Foucault’s Pendulum is meaningless, even though the entropy is probably the same. So entropy does not represent meaning or information or the lack of meaning or uncertainty or any such thing except in a narrowly defined sense as described by Shannon.
Here is a quote from Shannon’s original paper:
The fundamental problem of communication is that of reproducing at one point either exactly or approximately a message selected at another point. Frequently the messages have meaning; that is, they refer to or are correlated according to some system with certain physical or conceptual entities. These semantic aspects of communication are irrelevant to the engineering problem. The significant aspect is that the actual message is one selected from a set of possible messages
I repeat for emphasis:“These semantic aspects of communication are irrelevant to the engineering problem.” When we talk about information, all we’re talking about is the semantic aspect, not the engineering aspect. When a new discovery is made and it enhances our understanding of the world it increases the information content in the universe, but it is not directly related Shannon’s entropy. It does increase the size of our libraries, but so do many other things that are not considered new forms of information, such as a book containing a purely random stream of numbers that just hasn’t been recorded before. Entropy offers us no way to distinguish between the two and hence may not be useful as a direct measure of information.
How does this relate to my earlier thesis about information, entropy, and kurtosis? I’m not sure yet, but I’m thinking about it. I think it only requires a change in terminology to remain consistent. That is, I can remove all references to “information” and work only with the math of entropy and still have the essay be meaningful.

