Transparency of Quality is Essential for a Well-Functioning Health Care System

New York State CABG Mortality, with distribution, 1989-2011

A.  Introduction

Prospective patients will try to assess the quality of the medical care provided by the doctors or hospitals where they might go, when deciding where to seek treatment.  They seek out recommendations from friends and family, they look at publicly available rankings such as those of US News and World Report, and they have their own past experience with some doctor or hospital.  More recently, more information has become available on the internet, allowing prospective patients to look up personal histories on medical providers (where they went to medical school, their age, what languages they speak), as well as to view consumer comments and ratings on dedicated medical websites as well as websites such as Yelp.  There may also be reputational ratings (where doctors are asked what other doctors they would recommend), such as those conducted by the Washingtonian magazine in the Washington, DC, area.

But such information is limited, possibly biased, and superficial.  Recommendations of friends and family, your own experience, and comments and ratings on sites such as Yelp, are really just anecdotal, based on a very limited number of cases.  Individuals will also not always know whether the care they received was in fact high quality or not (there may have been complications, but they will normally not know if they were avoidable).  Rankings in reports such as that of US News and World Report have been criticized for being based on a small set of statistics (limited to those that the publication can obtain) which might have limited relevance.  And reputational ratings can be self-reinforcing, as those being surveyed rate some doctor or hospital highly simply because they have been highly rated in the past.  They may well have no real basis for making an assessment.

Most fundamentally, this information does not focus on what one really wants to know:  Does the doctor or hospital provide good quality care that will cure the patient?  Information such as that above has little on whether the doctors or hospitals are in fact any good at what they do.  Rather, the information is mostly on inputs (where did the doctor go to medical school, for example), or on superficial factors (was the receptionist pleasant when one checked in).

As a result, one can find out more on the quality of a $500 television that one is looking to buy, than on the quality of a doctor who will perform a coronary artery bypass surgery on you.

But information on actual results of doctors and hospitals, in terms of success rates (was the condition cured) and mortality rates, the frequency of medical complications, and other such measures, in fact exist.  The problem is that most of this information, with some exceptions noted below, is kept secret from the public.  Especially limited is information on the performance of specific doctors.  But the information is collected.  There are mandatory reports filed with government and regulatory authorities (both at the federal and state levels in the US).  Insurance companies (including Medicare) will know for the population they cover whether the treatment actually worked or required additional attempts or changes in approach.  Insurance will also know whether there were complications that then had to be treated (with the resulting expenses then filed).  And they will know all this at the level of the individual doctor and medical facility, and for the well defined specific medical procedures which were performed.

The information therefore exists.  The problem is that it is not made publicly available.  The normal rationale provided for this secrecy is that the information is complex and can be difficult to interpret by someone other than a medical professional.  But that is a lame excuse.  The information could be released in a form which adjusts for such factors as the underlying riskiness of the particular cases a doctor has dealt with (there are standard statistical ways to do this), and with accompanying information on the degree of uncertainty (derived statistically) in the information being provided.  One would also expect that if such information were made publicly available, then specialized firms would develop who would take such information and assess it.  Based on their technical analysis, they would sell their findings to insurance companies and firms, as well as interested individuals, on which doctors and facilities performed the best for specific medical procedures.  Government entities interested in good quality care (such as Medicare, in the public interest and also because good quality care costs less in the end) could also assess and make such information available, for free.

The real reason such information on outcomes is in general not made publicly available is rather that the results can be embarrassing for the doctors and hospitals.  And more than simply embarrassing, there could be huge financial implications as well.  Patients would avoid the doctors and hospitals who had poor medical outcomes.  With close to $3 trillion now being spent each year on medical care in the US, this means there are huge vested interests in keeping this information secret from the public.

This is starting to change, however.  As noted above, there are exceptions as well as experiments underway to provide such information to the public.  But it has been fragmented, partial, and highly limited.  The limited information that has been provided so far has been primarily at the level of hospitals, although there have been some experiments with data also being provided on the performance of individual doctors in certain specialties.

From these trials and experiments, we know that widespread availability of such information in an easily accessible form could have profound impacts on the practice of American medicine.

B.  The Impact of Transparency – A New York Experiment

The oldest and longest lasting experiment has been in New York.  Starting with data from 1989 (made publicly available in 1990), the New York State Public Health Commissioner has released the risk-adjusted 30-day in-hospital mortality rates of those undergoing coronary artery bypass graft (CABG, or simply heart bypass) surgery, by specific hospital.  They started to release physician specific mortality rates (on a three-year rolling basis) from December 1992.  There have been a number of good descriptions of, and analyses of the impacts of, the New York program.  Sources I have used include the articles here, here, here, and here.  In addition, a good description is provided as the third chapter in the excellent book by Dr. Marty Makary, Unaccountable, a source I will make further use of below.  Dr. Marty Makary is a physician at The Johns Hopkins Hospital, specializing in pancreatic surgery.  In addition to his many medical research publications, Dr. Makary has undertaken research on how to improve the quality of medical care delivery.

The chart at the top of this post shows what happened to 30-day in-hospital mortality rates following heart bypass surgery since 1989, across hospitals in New York State performing this procedure.  Only hospitals doing 70 or more such surgeries in any given year are included in the chart.  This was to reduce the statistical noise arising from small samples (and there were only a few exclusions:  two hospitals were excluded in two of the 23 years of data, and only one or zero in all of the other years).  A total of 28 hospitals were covered in the 1989 set, with the number rising over time to 38 in 2011.

The data were drawn from the annual reports issued by the New York State Department of Health.  Reports for 1994 to 2011 (the most recent report issued) are available on their web site.  Reports for earlier years were provided to me by a helpful staff member (whom I would like to thank), and the figures for the first half of 1989 were published in a December 1990 article in the Journal of the American Medical Association.  All the mortality rates shown are risk-adjusted rates, as estimated by the New York Department of Health, which controls for the relative riskiness of the patients (compared to the others in New York State that year) that were treated in the facility.

The chart depicts a remarkable improvement in mortality rates once it became known that the figures would be gathered and made publicly available, with individual hospitals named.  The chart shows the fall over time of the average rate across the state (note this is not the median rate, but rather the mean), as well as the minimum and maximum rates across all hospitals with 70 or more CABG procedures in the year.  The ranges at the 90th and 10th percentiles are also shown.  Among the points to note:

1)  The average risk-adjusted mortality rate fell sharply in the early years, and since then has continued to improve.  Furthermore, the underlying improvement was in fact greater than what it appears to be in these figures.  The average mortality rates shown in the chart are for the mix of patients (by riskiness of their health status) in each given year.  But especially in the early years, when angioplasty and coronary stent procedures were developing and found to be suitable for lower risk patients, the pool of patients for whom coronary bypass surgery was needed became a riskier mix.  Taking this into account, while the overall average mortality rate fell by a very significant 21% between 1989 and 1992, once one accounts for the higher risk of the patients operated on in 1992, the fall in the cross year risk-adjusted mortality rate was an even larger 41% over just this three year period.  Technology for CABG procedures did not change over this period.  Transparency did.

2)  The improvement in the coronary artery bypass surgery mortality rate in New York is especially impressive as New York was starting from a rate which was already in 1989 better than the average across all US states.  And by 1992, the rate in New York was the best across all US states.

3)  What is perhaps even more interesting and important, not only did the average rate in New York improve, but also the dispersion in mortality rates across hospitals was dramatically reduced.  The maximum (worst) mortality rate dropped from almost 18% in the first half of 1989 to under 6% by 1992.  The minimum rate was 2.1% in 1989H1, and fell to zero in 9 of the 12 most recent years.  One sees this narrowing in dispersion also in the range between the 90th and 10th percentile bands.

Publication of the mortality results got a good deal of media attention in the early years, and led to pressure, especially on the poor performers, to improve.  Note that the information being gathered was not anything new.  State health authorities long had reports on death rates by hospitals.  What was new was to make this information publicly available, with hospitals named.

Hospitals with poor records then scrambled to improve.  A range of actions were taken.  Some might have seemed obvious, but even so, were not undertaken until the mortality rates by hospital were made publicly available.  For example, hospitals with poor records began to create cardiac specific teams of nurses and other staff, rather than draw on staff from a pool who could be assigned to a wide range of different medical conditions.  Such specialization allowed them to learn better what was needed in cardiac surgery, and to work better as teams.  Such a reorganization at Winthrop Hospital, which included bringing in a new Chief of Cardiac Surgery who led the effort, led to a drop in its mortality rate from 9.2% in 1989 (close to the worst in the state in that year) to 4.6% in 1990 and to 2.3% in 1991 (better than the state wide average that year of 3.1%).

Other issues were highly hospital specific.  For example, one hospital (St. Peter’s in Albany) saw that its mortality rates for pre-scheduled elective and even urgent CABG surgery cases were similar to those elsewhere in New York.  But it had especially poor rates for emergency cases, which raised its overall average.  After reviewing the data, its doctors concluded that they were not stabilizing sufficiently the emergency patients before the surgery.   After it corrected this, its mortality rates fell sharply.  They were among the highest in New York in 1991 and 1992 (at 6.6% and 5.8%), but the rates then fell to 2.5% in 1993 and 1.4% in 1994 (when the New York average rate was 2.5%).  Mortality in emergency cases fell from 26% in 1992 (11 of 42 cases) to 0% in 1993 (zero in 54 emergency cases).

Another hospital (Strong Memorial) also found that its mortality rates for routine elective cases were similar to the New York average, but very high for the emergency cases, bringing up its overall average.  The problem was that while they had a good adult cardiac surgeon, he was always fully booked with routine cases, and hence was not available when an emergency case came in.  They then used one of two doctors who were not trained in adult cardiac surgery to handle the emergencies (one was a vascular surgeon, and the other a specialist in pediatric cardiac surgery).  By hiring a new adult cardiac surgeon and then better balancing the schedule, the rates soon dropped to normal.

American health care has traditionally relied on state regulators, armed with reports on hospital and indeed surgeon specific practices and outcomes, to impose safety and good practice measures.  But there is no way a central regulator can know all that might be underlying the causes of poor outcomes, or what actions should be taken to remedy the problem.  They also will not focus on hospitals with relatively good, or even average, mortality rates, even though such institutions could often still improve.  By releasing the data to the public, hospitals with poor records will be under great pressure to improve, while even those with relatively good records will see the need to get better if they are to stay competitive.  And the actions taken will often be actions that no central regulator would have been able to see, much less require.

C.  Staff Surveys as Another Indicator of Quality

Outcome indicators, up to and including mortality rates, are one set of measures which could have a profound impact on the quality of health care delivery if made publicly available.  An additional type of measure has been developed by Dr. Marty Makary, tested with a number of hospitals, and is now routinely used in hospitals across the US.  But the results are then typically kept secret from the public.

Specifically, Dr. Makary developed a simple staff survey (see here and here, in addition to his book Unaccountable referenced above) with some key questions.  The survey goes to all staff in a hospital, and asks questions such as whether the respondent would feel comfortable having their own care performed in the hospital unit in which they work.

In the original test, the surveys were sent to all staff at 60 hospitals across the US.  They got a 77% response rate, which is quite good.  What is most interesting was the wide range they found in the results across the hospitals.  For example, on the question of whether the staff member would want their own care performed at the hospital unit in which they work, there were two hospitals where close to 100% of the staff said they would, but also one hospital in which only 16% said they would.  There was a fairly even spread between these two extremes, and in about half of the hospitals surveyed, less than half of the staff said they would want their own care performed there:

Makary Hospital Staff Survey - Care in Own Unit.003

This would be powerful information to have as a patient.  The insiders are really the ones who know best what quality of care is being provided.  If even they would not want their health care needs met at their hospital, one knows where one would want to avoid.

It is recognized that the original Makary survey was done with the promise that the identities of the individual hospitals would not be revealed.  Should such surveys be made publicly available, the staff responding might well be less negative.  But the identities of the individual staff members would still be kept confidential (with the data gathered by an independent third party, and anonymously over the web).  There would certainly still be some dispersion in results across hospitals, and one could take into account the possible biases when judging the results.  And if a hospital is rated poorly by its staff even when they know the results will be made public, one knows which hospitals to avoid.  One would expect such hospitals then to scramble to improve the quality of the care they provide.

D.  While a Number of Transparency Initiatives Are Underway, They Remain Fragmented and Partial

Patients have always sought information on the quality of the care they will need, and have made decisions on where to go based on what they can find out.  But the information that they have been able to obtain has been only partial, highly fragmented, and far from what they really need to know to make a wise decision.

People will also find measures that are easily observed, but not necessarily terribly important to the quality of the care they will receive.  For example, they may find out whether parking is free and convenient, but this should not normally be a driving factor for their decision.  More relevant, and obviously something they will know, will be geographic location:  Is the facility close to them, or further away?  But they will normally have little basis for determining whether it is worthwhile to go a facility that is further away.

There has been a substantial expansion in recent years in the amount of information one can find on providers.  While still limited, one can find out more now than before.  There is the New York experiment described above, which New York soon extended from hospitals to individual surgeons, and also to angioplasty and cardiac stent procedures.  New York has also brought together on one web site easy access to a wide range of health topic data sets.  These include data sets on outcomes and quality of care indicators (such as the most recent CABG mortality rates by hospital and by surgeon, for example) but also many others (such as the most common baby names chosen).

The Obama administration has also expanded substantially the public availability of information on hospital quality measures.  The Centers for Medicare and Medicaid Services (CMS) now makes available at its Medicare Hospital Compare site results at the hospital level, drawn primarily from the data they have for Medicare patients, on such outcome measures as mortality rates, complications, hospital readmission rates, and other indicators.  However, they are still partial, and instead of showing, for example, actual and historical figures by hospital for indicators such as the rate of complications or mortality, they simply show whether the rates are similar to the national norm, or better or worse by a statistically significant margin (at the 95% significance level).

With the clear positive impact of the New York experiment, other states have also begun to implement similar programs.  But they remain partial and fragmented, and do not provide the comprehensive picture a patient really needs if they are to make a wise choice.

In addition, many professional medical societies have begun to collect similar data from their members, and then calculate risk-adjusted measures.  However, they have then kept the individual results secret, with identifying information by hospital or physician not made available.  Individual hospitals and physicians could release them if they so chose, and some have.  But one can safely assume that only those with good results will release the information, while those with poor results will not.

The same is true for hospital staff surveys, such as the one described above pioneered by Dr. Makary.  Such surveys are now widely used.  Dr. Makary reports in Unaccountable (published in 2012) that approximately 1,500 hospitals were then undertaking such surveys.  The number is certainly higher now.  But the results are in general kept secret.  Some hospitals make them publicly available, but one can again safely assume that these will be the ones with the better results.  Without the others for comparison, it is difficult to judge how meaningful the individual figures are.

So the relevant data are often collected already.  It is only a matter of making them public.  There is not a question of feasibility in collecting such data, but rather a question of willingness to make them public.

E.  What a Transparent System of Information on Quality Should Include

As noted above, people will gather what information they can.  But what they can gather now is limited.  What is needed is hard data on actual outcomes, identified by hospital and by individual doctor.  As the New York experiment discussed above indicates, the result could have a profound impact on quality of care.

Specifically, there should be easy access to the following specific measures:

a)  Volume:  While not directly an outcome measure, it is now well established in the literature that a higher frequency of a doctor undertaking some specific medical procedure, or that is done by all the doctors at some hospital or medical facility, is positively associated with better outcomes.  A doctor that undertakes a procedure a hundred times a year, or more, will on average have better outcomes than one who does the procedure only a dozen times a year (i.e. once a month).  And volume can be easily measured.  The problem is in obtaining easy access to the information, and at the relevant level of detail (i.e. by individual doctor, and for the procedure actually being considered for the patient, not just of some standard benchmark procedure).

b)  Success rates:  While many of the outcome measures being used in various trials and experiments are negative measures (mortality rates; complication rates), a more useful starting point would be risk-adjusted success rates.  What percentage of the procedures undertaken by the individual doctor or at the medical facility for some condition actually leads to a cure of the condition?  How success is defined will vary by the medical issue, but standard ones are available.  If the risk-adjusted success rate is 80% for one doctor and 99% for another, the choice should be clear.  Yet I have never seen a trial or experiment where such success rates by medical facility, much less at the level of individual doctors, were made publicly available.

c)  Success rates without complications:  A more stringent measure would be not only that the procedure was a success, but that it was achieved without a noteworthy complication such as an infection.

d)  Complication rates:  Moving to negative measures, one wants to see minimized the complications associated with some procedure.  The medical profession has identified the complications often found as a result of some medical procedure, and significant complications will be reported.  They can also normally be identified from medical insurance records, as they require treatment.  As with mortality rates, these should be published on a risk-adjusted basis.

e)  Mortality rates:  The ultimate “complication” is mortality.  As discussed extensively above, these should be made available by medical procedure and by individual doctor on a risk-adjusted basis.  The 30 day mortality might be appropriate for most medical procedures, but for others the 60 day or 90 day rates might be more appropriate.  Medical societies can work out what makes most sense for a given procedure.  But everyone should then be required to use the same measure, to allow comparability.

f)  Bounceback rates:  Bounceback rates are the percentage of patients undergoing some procedure, who then need to be readmitted back to a hospital (the original one or some other) within some period following release, usually 90 days.  Readmission rates are regularly collected by hospitals, and they can also be risk adjusted when made publicly available.  They are a good indication that some problem developed.  Some rate of readmission might well be expected for certain procedures.  They are not risk free.  But one wants to see if the bounceback rates are especially high, or low, for the physician or medical facility being considered.

g)  Never events:  Never events are events that should never occur.  While a certain rate of complications will normally be expected, one should never see an operation done on the wrong side of the body, or sponges or medical instruments left in the body after the surgeon has sewn up.  Hospitals know these and keep track of them (as such never events often lead to expensive lawsuits), but not surprisingly want to keep them secret.

h)  Hospital Staff Surveys:  As discussed above, Dr. Marty Makary developed a survey that would go to all hospital staff, which asks a series of questions on the quality of care being provided at the facility.  While approximately 1,500 hospitals were already administering the survey in 2012 (when his book Unaccountable was published), they are voluntary and in general not made publicly available.  They should be.

While the surveys can cover a long list of questions, Dr. Makary recommends (Unaccountable, page 216) that the percentage of hospital staff responding “yes” to the following three questions, at least, should be made public:

–  “Would you have your operation at the hospital in which you work?”

–  “Do you feel comfortable speaking up when you have a safety concern?”

–  “Does the teamwork here promote doing what’s right for the patient?”

F.  Conclusion

There are of course many other measures of quality one could examine, and there has been some movement in recent years to making more available.  These include results from patient surveys (“were you content with your experience at the hospital?”, “were the rooms kept clean?”), as well as the percentage of cases where certain established medical best practices were followed (“was aspirin given within 24 hours of a suspected heart attack?”).

Such additional measures might well be useful in particular cases.  It will depend on the individual, their particular condition, and what specifically is important to them.  People should have a choice, and do the research they personally wish to do.

But until hard measures on actual outcomes, such as those described above, are made widely available, and on a comprehensive rather than partial and fragmented basis, it will not be possible to make a well informed and wise choice on which doctor and medical facility to go to.  Without this, there can be no effective competition across providers.  There will be little pressure on the poor quality providers either to improve their performance, or drop out and let providers who can deliver better quality care treat the patients.

The impact on the quality of health care services provided would be profound.

More on the Widely Varying Charges for Common Health Procedures: Price Variation for Standard Blood Tests

Blood Test Prices in California - Lipid Panel

A.  Widely Varying Prices Charged Even for Standard Blood Tests

This post is an addition to an earlier post on this blog that looked at the widely varying prices being charged in the US for common health procedures.  As that post noted, such differences in prices for what are fundamentally the same services are a clear indication that the market is not working.  The prices would be similar if the market was working, with differences that are relatively small and explainable by factors such as geography.  But that is not the case.

That post looked at data from a number of studies (including my own simple research on the prices that I would be charged in the Washington, DC, area, for a common surgical procedure).  Prices could vary by a factor of 10, and indeed often even more.  And as that post showed in a series of charts, the prices actually paid in the US (at the rates negotiated by insurers) are not only widely varying, but also consistently far higher than the prices paid for the same procedures in other countries.

A criticism of studies that examine the prices being charged for health care procedures is that individual cases can differ, with some more complex than others.  Thus prices might vary for that reason.  Even though it is difficult to see how costs can vary by a factor of ten or more even with differing levels of complexity for some standard procedure (such as a hip or knee replacement, for example), one can recognize that differing degrees of complexity might explain at least some of the price differences.

Thus a study published last week in the BMJOpen, an open-access on-line journal affiliated with the British Medical Journal, is of interest as it addresses the question of whether such price variation is found also for procedures where case complexity does not enter.  The lead author is Dr. Renee Hsia, of the Department of Emergency Medicine at the University of California – San Francisco.  In an earlier study, summarized in the blog post cited above on health care price variation, Dr. Hsia had looked at the prices charged by hospitals in California for an uncomplicated but urgent appendectomy.  She found that the prices varied by a factor of 120, between the lowest rate charged and the highest.

In the current study, Dr. Hsia with her colleagues looked at the prices charged by California hospitals for ten common blood tests.  The prices reviewed are the so-called “chargemaster” rates, or the list prices at the hospitals for the tests.  The actual price paid will then normally be a lower rate negotiated with the hospital by your insurer (if you have insurance), but the chargemaster rate is the starting point.  Why this matters will be discussed below.

Dr. Hsia was able to obtain the data for California because hospitals there are required to report to state authorities the average prices they charged for a number of common procedures.  Since routine blood tests are standard, and are not more or less complicated for one patient vs. another (the blood is drawn, brought to a standard machine, and the results then read), one cannot argue that the price variation observed might be a consequence of different degrees of case complexity.

The results from one of the blood tests examined, that of a standard lipid test (which measures blood cholesterol levels), is shown graphically at the top of this post.  Data was available from 178 hospitals, and each hospital reported the average price it charged for this test over the course of 2011.  The price charged at one hospital was only $10 per test.  The average price charged at a different hospital, for the exact same blood test, was $10,169 per test, or over 1,000 times as much.  Such variation is absurd.

These are, of course, the extremes.  But even if one focusses on observations in the middle of the distribution, it is impossible to see how such variation in prices charged can be justified.  The price at the 5th percentile (meaning 5% of the hospitals charged this price or less) was $76.  The price charged at the 95th percentile (meaning 5% charged this price or more) was $602.  This is almost 8 times higher than the price at the 5th percentile.  The results for the other nine blood tests examined were broadly similar (with ratios between the prices at the 95th and 5th percentiles varying from a high of 12 times and a low of 6.8 times).

B.  Chargemaster Rates Matter

What can justify such a spread?  Nothing that I can see.  The tests are standard, use standard machines, and all use similarly drawn blood.  The response of a spokeswoman for the California Hospital Association was that the prices reviewed in the study are “meaningless”, since virtually no one (she states) pay these rates.  As noted above, the rates reviewed in the study, as in the earlier study of the prices charged for appendectomies, are the chargemaster rates of the hospitals.  These are the regular list prices for the procedures, which are then typically discounted in negotiations with individual insurers.

But there are still several problems with this, including:

1)  How much the prices are negotiated down will vary according to the bargaining strength of the patient’s individual insurer in the region.  In the bargaining process discussed in an earlier post in this series on health reform, insurers will bargain with hospitals on what the rates will be.  Their relative bargaining strength will depend on how concentrated the local market is in terms of hospitals (if there is only one hospital, or one chain of hospitals all owned by the same entity, but a number of insurers, the bargaining power of the hospital will be great) versus insurers (in one insurer dominates in the market, while there are many hospitals, that insurer will have great bargaining power).  If you have insurance with an insurer who does not command great market share in the region, the price you will have to pay may be close to the chargemaster rate.

2)  If you do not have insurance (and many could not get health insurance, prior to the reforms of Obamacare), you will be charged the chargemaster rate.  You might then try to bargain individually with the hospital, but the starting point will be the chargemaster rate.  And many hospitals will insist, unless you are poor, that you have to pay that chargemaster rate.

3)  You may well have insurance, but if the particular hospital you are in is not in your insurance network (perhaps because you were brought by an ambulance to the nearest hospital in an emergency), you will be charged the chargemaster rate.  Your insurance company might pay a portion of this at what they consider to be a “reasonable rate”, but this is likely to be close to what your insurer has negotiated with others, and as we have discussed in the earlier blog posts cited above, this might be only one-tenth of the chargemaster rate.  You will then still be responsible for the other 90%.  This can be a lot, if you are at the hospital where a simple lipid panel blood test is charged at over $10,000.

4)  You may well again have insurance, and be in a facility that is in-network for your insurer, but your insurer might disagree on whether some standard blood or other test ordered by your doctor was really needed.  Your insurer will then refuse to cover the cost of that test, and you will be charged the chargemaster rate.

I am personally facing a case of that right now.  While the amounts are small in absolute terms, the issue is the same.  My doctor ordered a set of routine blood tests for me earlier this year, and my insurer covered all except one.  For that one, the insurer asserted that there had not been a need (even though both my doctor, and research I found on the web, made clear that the test was in fact needed).  The lab therefore charged me the full chargemaster rate (which in this case was $213.98), even though the negotiated rate Aetna would have paid, had they agreed it should be covered, was only $16.23.  That is, the full billed rate was 13.2 times the negotiated rate.  I would have been glad to pay the negotiated rate in full, and the $16.23 the lab has negotiated with Aetna is evidently a rate sufficient to provide an adequate profit to the lab.  But find it absurd that I should have to pay over 13 times more.  I am appealing, but do not know yet the outcome.

5)  Finally, it is worth noting that the chargemaster rates can matter for other issues as well. For example, hospitals are typically required to provide a certain amount of “charity care” (care provided to the poor without health insurance for free or at discounted rates) in order to benefit from certain tax breaks.  This is especially important and valuable for private, profit-making, hospitals.  Valuing such services at the chargemaster rates, when these rates are 1000 times higher than what someone else might charge, will make it look as if the hospital is providing a good deal of charity care.

C.  Conclusion

This new study should put to rest the argument that price variation in health care services is principally due to variation in the degree of complexity of individual cases.  Common blood tests are standard, and they show price variation which is huge as well as similar in degree to that seen for standard health care procedures (see the review in the earlier post).  The prices vary not principally due to case complexity, but rather due to a grossly misfunctioning market for health care services, where there are strong forces keeping out effective competition and any pressure to converge on low prices from efficient providers.

America’s Underinvestment in Public Infrastructure

Real per Capita Public Investment vs. GDP, 1950-2013

Public infrastructure in the United States is an embarrassment.  This is clear even to ordinary travelers.  Countries in Europe and in much of East Asia enjoy far better roads, highways, public transit, and other forms of public infrastructure than the US does, even though the real per capita incomes of these countries are lower than that of the US.  And this is backed up by more systematic global comparisons, such as in the Global Competitiveness Report of the World Economic Forum.  The most recent report ranked the US as only number 15 in the world in terms of its infrastructure (transport, power, and telecom).  This put the US behind Canada, the major countries of Western Europe, and such countries as Japan, Korea, Hong Kong, Singapore, and Taiwan in East Asia.

The poor quality of public infrastructure in the US should not, however, be a surprise.  As the chart at the top of this post shows, the US is spending no more now, in real per capita terms, than it did over a half century ago in 1960, in the last year of the Eisenhower administration.  The chart draws on data issued in the standard GDP (NIPA) accounts of the BEA of the US Department of Commerce.  Infrastructure investment is taken to be total government investment (at all levels of government – Federal, State, and Local) in structures, excluding such spending by the military.  Most government infrastructure spending in the US is for transport (primarily roads and associated bridges, but also including investment in mass transit, ports, and airports), with a significant amount also for water and wastewater treatment.

Public infrastructure spending in real per capita terms rose during the Eisenhower administration in the 1950s (when the Interstate Highway system was started) and continued rising during the Kennedy and Johnson administrations in the 1960s.  Indeed, during this period, such spending rose at a somewhat faster pace than real per capita GDP, the blue line in the chart.  But starting in 1969, the year Nixon took office, public infrastructure spending was cut.  By the mid-1970s it was down close to the level seen at the end of the Eisenhower administration (in real per capita terms), and then was cut even further at the start of the Reagan administration.  It then began to increase from 1984 with this continuing to a peak in 2002, after which it fell again.  By 2013 it was 2% lower than it was in 1960.  Over this same period, real per capita GDP almost tripled.

In dollar terms, real per capita spending on public infrastructure (in terms of 2009 prices, the base now used in the GDP accounts) was $793 in 1960 and was 2% lower, at $776, in 2013 (about 1.6% of GDP).  Over this same period, per capita real GDP rose from $17,159 in 1960 to $49,852 in 2013.  The increment in real per capita GDP was $32,693 over this period.  None of this growth went to increased investment in public infrastructure.

It is this stagnation in real per capita spending, and huge lag behind income growth, that has led to bridges and highways that are both congested and in poor condition.  People drive more, fly more, and import and export more goods, as their real incomes grow.  Public infrastructure has not kept up.  A 2009 report issued by the American Association of State Highway and Transportation Officials (AASTO) notes that vehicle miles driven between 1990 and 2007 rose by 41%, about double the increase in the US population over this 17-year period (of 20.6%).  Based on the figures in the chart above (which however covers all public infrastructure, not just highways), spending to build or maintain such infrastructure per mile driven fell by over 20% over that period.

The AASTO report also found (based on an analysis of US Federal Highway Administration data) that one-third (33%) of the nation’s major highways was classified as being only in poor or mediocre condition (as of 2007).   Thirteen percent was classified to be in poor condition, with this rising to over 60% poor in some major urban areas.   And roads in poor or mediocre condition deteriorate quickly, leading to much higher costs when the road eventually has to be repaired.  The AASTO report notes that the cost per mile over 25 years is three times higher if roads are left to be reconstructed, instead of maintained on the regular recommended schedule.

This stagnation in real per capita spending on public infrastructure over more than a half century may be surprising to some.  While many might be aware that infrastructure spending has not kept up with real per capita GDP (which has almost tripled), most people would assume that there has been at least some increase in per capita infrastructure spending.  But that is not the case.

Part of the reason for this mis-conception is that when measured as a share of GDP, it might not appear that public infrastructure spending has fallen so far behind.  As a share of GDP, public infrastructure spending (using the figures cited above for public investment in non-Defense structures, from the BEA accounts) was 39% less in 2013 than it was in 1960.  Put another way, public infrastructure spending would have had to increase by 64% (=1/(1-.39)) between 1960 and 2013, to match the GDP share it had in 1960.  But the figures shown above in the chart indicate that public infrastructure spending would have had to triple over this period to match the increase in GDP.

Why this big difference?  The reason is Baumol’s Cost Disease, which was discussed in an earlier post on this blog.  If the price index for public infrastructure spending over this period had matched the price index for overall GDP, then an increase in infrastructure spending of 64% would suffice to bring it into line with the increase in real GDP over the period.  But the cost of building infrastructure has risen at a faster pace than the cost of making goods generally.  This is not because of increased waste, but rather because building infrastructure is by nature labor intensive and hard to automate.  The relative cost of infrastructure will therefore increase over time relative to the cost of goods whose production can be increasingly automated.

The importance of this is huge, but is often ignored in the debates.  As the chart above shows, investment in public infrastructure has stagnated in real per capita terms over more than a half century, and would need to almost triple at this point to catch up with how much real per capita GDP has grown.  This is far greater than the 64% increase (which is itself not small) that one might assume would be necessary by simply focussing on GDP shares.

The fundamental cause of this stagnation in real spending on public infrastructure has been an unwillingness in Congress to pay for it.  The most important source of funding for highway expenditures has been the gasoline tax, which supports the Highway Trust Fund. But as was discussed in an earlier post on this blog, gasoline taxes have been set as so many cents per gallon and are not adjusted regularly for inflation.  The last time the tax was raised (in nominal terms) was in 1993, over 20 years ago.  Since then, even general inflation has eroded this by over 50%.  If one took into account that prices for infrastructure investments rise at a substantially faster pace than general prices (due to Baumol’s Cost Disease, discussed above), the real erosion has been much greater.  As a result, funds in the Highway Trust Fund are far from adequate.

The result has been repeated crises as the Congress passes one short term patch after another to allow even the overly low on-going highway investments to continue.  One such crisis is underway now, where expenditures would need to be slashed on August 1 if nothing is done.  The Senate is currently expected to vote this week on an extension, although it would only be for a few months at best.  If passed and can then be reconciled with a similar House passed measure (passed two weeks ago), spending on highway investment will be able to continue for a few more months.

To provide the needed funds, given that the Highway Trust Fund is far from sufficient (due to the failure to adjust the tax to reflect inflation), Congress has included again an especially stupid provision in the draft bills.  As it did in an earlier authorization in 2012 (see the blog post cited above), Congress would allow corporations to make assumptions on their pension obligations which will in effect allow them to underfund their pension obligations by even more than currently.  The corporations will then show (on their balance sheets) higher profits, which will generate somewhat higher corporate income tax obligations.  These higher tax obligations will be counted as government revenues.  But those reliant on corporate pensions will be at greater risk of not receiving the pensions they are owed.  Ultimately the government may be obliged to cover these pension obligations (through the Pension Benefit Guarantee Corporation).  But these costs latter costs are being ignored.