Stop citing statistical indicators that you do not understand

In a previous blog post, I denounced the overuse and misuse of the GDP and CPI indicators.

Today, I condemn two additional economic indicators: manufacturing output indices and productivity indices:

My criticism is simple and harsh: these indices do not convey useful, objective information about the real world. There is no reasonable way to mash together numbers with different units, such as tons of steel forged, silicon chips fabricated, and automobiles assembled in order to get a single number that conveys meaning. The process of doing so actually destroys information. If I had the raw numbers about steel or car production, that would tell me something about the world. But if I combine them into one number – what does that number mean? What does it tell me about the world? What sort of numerical alchemy translated the number?

If a number produced via model really tells you useful information about the real world, then you would be able to reason out how a change in the real the world will affect the number. For instance, an expert in baseball statistics could tell me how a pitchers FIP or WAR or PECOTA would change in result to a given set of real world results.

So here is a set of questions for people who cite the manufacturing and productivity indices. Please cite official documentation when answering these questions:

1) At time A, the U.S. makes all of its CPUs internally. It makes 100 million CPUs a year. At time B, most of the CPUs fabs have been shipped overseas. It now only makes 10 million CPUs. But, these CPUs now have 20 times as many transistors, and operate at 2000MHZ instead of 500MHZ. How much has manufacturing output increased or decreased according to the government indices?

2) A bank replaces a five bank tellers with an ATM machine. The ATM machine is built in China, using 100-man-years of labor. It will last for ten years. How much has labor productivity gone up or down in the United States, according to the official indices?

3) A school replaces four teachers with a computer program that automatically grades assignments. The school also adds two teachers due to a new special education law that requires the school to give extra help to the mentally challenged. These new special education teachers end up having no measurable impact on test scores. How much has the labor productivity in education changed? What if the new special education teachers increased test scores by 10%?

4) At time A, 25% of the cost of a Boeing airplane comes from importing parts made in Asia. The other 25% are U.S. labor costs, and the other 50% is materials. At time B, even more parts are made in Asia. The jet turbines are made in Asia, the electronics are made in Asia, the wheels are made in Asia, etc. But by cost, the same percentage of the plane is made in Asia. Thanks to new factories in China, the price of Asian goods has fallen, and thus twice as many parts can be imported for the same price. Has the manufacturing output of the United States gone up, gone down, or stayed the same?

I’m guessing you do not know the answer to these questions. I’m guessing that no one knows the answer to these questions. If you do know, please link me to the exact part of the methodology for these indices that shows the precise numerical calculation that could answer one of the above questions.

Note that in both scenario 1) and 4), a reasonable person would say that these are instances of the U.S. economy hollowing out. Yes, in case #1 the net total number of transistors being produced in the U.S. may have gone up. But the bottom line is that the U.S. went from making its own CPUs to importing its CPUs. America has gone from building stuff to importing stuff. Yet these cases, I have no idea if that “hollowing out” would be captured in the manufacturing indices. And you do not know either. And neither do the people who cite these statistics and say, “look the economy is not hollowing out, these charts prove it.” Thus people who cite these amalgamated statistics are not contributing to the conversation, they are just adding noise.

I also often come across long blog posts where people are trying to reason about why these numbers have gone up or why they have gone down. But if nobody knows how these numbers are calculated, if nobody knows how real world events would impact these numbers, then how on earth can we speculate about the meaning behind the numbers?

The entire process of figuring out how the world works is a process of refining complex data into simple understandable models, and then building more complex models on top of those simple models. If you start from a model that nobody understands and that has no predictive ability, then you have just destroyed your ability to think. You are starting from nonsense.

And in fact, there is no objective way to calculate these numbers. These indices suffer from the same defects that afflict GDP and the CPI (see my criticism here). There is no mathematically sound way to combine multiple numbers that have different units. To do so is to turn mathematics into nonsense. Thus what actually happens is the statisticians constructing these numbers just invent various rationales for how to compare the dollar values of these goods over time, and then fudge and fix things until the numbers look somewhat plausible. The end result is equivalent to simply surveying a sample of government economists and just asking them to estimate, “How much more productive is the United States today than it was 30 years ago?” This whole process just disguises a survey as being an objective calculation.

My plea going forward is that if you want to discuss manufacturing or productivity, talk about objective numbers that are comparable across time. Talk about the actual tons of steel forged in the U.S., or the actual number of automobile engine blocks produced.