The Revisions to Gross Domestic Product, Gross Domestic Income, and Their Major Components

Now more than ever, the private sector and policymakers rely on the National Income and Product Accounts (NIPAs) to provide a timely, comprehensive, and accurate picture of the condition of the U.S. economy to inform economic decisions.

The two featured measures in this study, gross domestic product (GDP) and gross domestic income (GDI), are the two most used measures to assess a country's economic pulse, and that is why, throughout this study, we systematically analyze GDP and GDI data releases in search of possible ways to improve the methods by which we aggregate and report data at different frequencies and in different sectors of the economy.

GDP measures activity as the sum of all final expenditures in the economy; it is detailed on the product side of the domestic income and product account. Meanwhile, GDI measures activity as the sum of all incomes generated in production; it is detailed on the income side of the account. By design, the flows of final expenditures and incomes generated are equal, but in practice, they differ because each is estimated from largely different source data. The difference between GDP and GDI is allocated to the income side of the domestic income and product account and is known as the statistical discrepancy.

Measuring the accuracy of national account estimates is challenging because it is impossible to know the true values of the estimates. This measurement challenge is long-standing and derives from three principal facts about the compilation of the estimates. First, the earliest GDP and GDI estimates are based on partial and preliminary data and on trend projections when data are not available. Second, the source data for the early estimates in the NIPAs come from a mixture of survey, tax, and other business and administrative data as well as various indicators, such as heating degree days; these data are subject to a mix of sampling and nonsampling errors and biases that cannot be assessed. Third, the NIPAs are regularly revised to reflect changes in the economic concepts and methods necessary for the accounts to provide an accurate picture of the evolving U.S. economy.1 For example, in the 2013 comprehensive revision of the NIPAs, a new category of investment—intellectual property products (IPP)—was formed; it consists of research and development; entertainment, literary, and artistic originals; and software. This last component, software, was also measured before, but was bundled with equipment in a category called equipment and software. In addition, seasonal adjustment factors were updated to reflect changing seasonal patterns. Given the measurement challenges, accuracy cannot be assessed by conventional statistical measures, such as standard errors. Instead, the Bureau of Economic Analysis (BEA) focuses on the reliability of the estimates, which can be assessed by examining the sizes and patterns of revisions to estimates (see the box “Accuracy, Reliability, and Uncertainty”).

Because the revisions are repeated estimations of the economic activity in a period, their pattern and magnitude provide a measure of the reliability of the estimates. Some revisions are due to the replacements of early extrapolations for missing source data, preliminary survey data, or the replacement of indirect indicators with revised and more complete data, such as annual data and census data. However, some of the revisions are the result of updates to the concepts on which the accounts are based, as with the 2013 comprehensive revision mentioned above.

BEA's principal standard of reliability is based on the revisions from its early estimates to its “latest” estimates, most of which have been through at least one comprehensive revision (see the box “Vintages and Timing of Revisions” for the detailed revision schedule ).

Because the latest vintage incorporates the most recent comprehensive revision, it incorporates all the available source data that are believed to be the most reliable.

More specifically, BEA judges the qualitative reliability of its early estimates by whether they present the same general picture of economic activity as the latest estimates in terms of the following: 1) long-term growth rates; 2) trends in saving, investment, government spending, corporate profits, and other key components of GDP and GDI; 3) broad features of the business cycle, including the timing and depth of recessions, the strength of recoveries, and the major components contributing to growth and contractions; and 4) the patterns of quarterly growth, including whether growth in any period is high or low relative to the trend, is accelerating or decelerating, or is positive or negative.

As we will show in the following sections, both the pattern and the magnitude of the revisions indicate that the early estimates are reliable. That is, the revisions do not substantively change BEA's measures of long-term growth, the picture of business cycles, and the trends in major components of GDP and GDI. In short, the early estimates provide a consistent and reliable picture of general economic activity. Given the general accuracy of the early estimates, policymakers should be able to rely on these estimates as accurate measures of economic activity.

Quantitatively, revisions are measured as the changes from an earlier vintage of a given estimate to a later vintage of that estimate, for example, from the advance estimate to the latest estimate. There are three early vintages of “current quarterly” estimates of GDP—the advance estimates (released a month after the quarter ends), the second estimates (released 2 months after the quarter ends), and the third estimates (released 3 months after the quarter ends). Each estimate is produced using a wide mix of source data—preliminary survey results, such as the Census Bureau's Survey of Retail Sales and Services, and manufacturers' shipments, various indicators, trade industry data and more—that are later revised to reflect more complete information or to replace trend projections.

BEA examines qualitative measures of reliability that focus on the overall performance of the three vintages of the current quarterly estimates of growth, relative to the latest quarterly estimates of growth. More specifically, the qualitative measures address two questions about current-dollar GDP and current-dollar GDI.

  • Does the sign (+/−) of all the current quarterly vintages of estimates of growth match that of the latest estimate? This is a measure of directional reliability. For GDP, the signs match 95 percent of the time, and for GDI, 92 percent of the time.
  • Do the magnitudes of all the current quarterly estimates fall above, within, or below one standard deviation of the mean growth of the latest estimate over the period? This is a measure of reliability relative to trend. For 1996–2018, the latest estimates of GDP had a mean of 4.42 percent and a standard deviation of 2.72, and the current quarterly vintages fell within one standard deviation of the mean (between1.71 and 7.15) about 78 percent of the time. For GDI, the corresponding mean is 4.50 percent and the standard deviation is 3.11, and the current quarterly vintage fell between one standard deviation of the mean (between 1.38 and 7.61), also about 78 percent of the time.

It is important to note that the frequency counts were based on a comparison between all the current quarterly vintages available for a quarter and the latest quarterly vintage for that quarter. For GDP, there are always three current quarter estimates to compare, the advance, second, and third, but for GDI, from 1996 to the end of 2003, the available current quarter estimates are only second (expect for fourth quarters) and third, and from 2004 on, the fourth estimate is also available, and therefore included in our comparisons with the latest estimate.

As a preview of the analysis, chart 1 provides an illustration of this point by plotting advance estimates of real GDP growth over 22 years, from 1996 to 2018, against the latest estimates.2

Chart 1 shows that, with few exceptions such as the Great Recession period and a couple more quarters throughout the sample, the overall pattern of movement in GDP is little changed by revisions. The Uncertainty Around Turning Points section will provide a more detailed analysis of the estimates' revisions around the two recessions included in the sample.

A similar picture emerges for the third estimate of GDI versus the latest estimates (chart 2).

The remainder of this article is organized as follows:

In table 1, the MARs to the annual rates of change of both current-dollar and real quarterly GDP growth—the average of the revisions without regard to sign from the current quarterly estimates to the latest estimates—range from 1.23 percentage points to 1.36 percentage points for 1996–2018. The MRs—the average allowing both positive and negative values for the revisions–range between −0.12 percentage point to 0.17 percentage point. MRs indicate whether the revisions are generally positive or negative. Because revisions may be offsetting, the MRs are much smaller than the MARs (see the box “Mean Revisions, Mean Absolute Revisions, and Standard Deviations”).

The MARs among the current quarterly estimates are smaller than the MARs from the current quarterly estimates to the latest estimates. For example, in table 2, the MAR from the advance estimates to the second estimates of real GDP is 0.50 percentage point, the MAR from the advance estimates to the third estimates is 0.63 percentage point, and the MAR from the second estimates to the third estimates is 0.28 percentage point.

In table B in the appendix, one can see, for example, that the MR for real GDP from the advance estimates to the latest estimates is just −0.01 percentage point, a tiny revision. The MRs from the second and third estimates to the latest estimates are −0.09 and −0.10 percentage point respectively, still rather small. To put these MRs into context for 1996–2018, the mean growth of real GDP (latest estimates) over this period was 2.50 percent, and the growth rates ranged from −8.38 percent to 7.53 percent.

Mean absolute revisions

Table 1 presents revisions for real and current-dollar GDP and their components. For all three current quarterly vintages for 1996–2018, the MARs for both current-dollar GDP and real GDP are somewhat larger than 1.20 percentage points. The MARs for current–dollar GDP are slightly larger than the MARs for real GDP for all current quarterly vintages. For current–dollar measures, the MARs decline slightly but steadily from the advance to the third estimates, while for real measures, the MARs stay constant and then decrease slightly from the advance to the third estimates. The decline in MARs with successive vintages is expected, as BEA receives and incorporates more and better source data with each vintage.

The MARs for current-dollar personal consumption expenditures (PCE) are about 0.30 percentage point smaller than those for the corresponding vintages of current-dollar GDP; the MARs are 1.03 percentage point for the advance estimates, 0.92 percentage point for the second estimate and 0.90 percentage point for the third estimate. Those for real PCE are slightly smaller at 0.99 percentage point, 0.90 percentage point, and 0.88 percentage point for the advance, second, and third vintages, respectively. MARs for both current-dollar and real measures tend to decline slightly with successive vintages.

Within PCE, the MARs for durable goods are noticeably larger, about 3.93 percentage points for current-dollar durable goods and 4.14 percentage points for real durable goods. Also notable is that durable goods is the only category within PCE in which the MARs are larger for the real measure than for the current-dollar measure.

The MARs for nondurable goods are larger than those for all PCE and tend to decline with successive vintages of estimates. The MARs for the advance estimate is 2.12 percentage points, and those for the second and third estimates are close to 2.00 percentage points. The MARs for the real estimates are approximately 0.35 percentage point smaller than those for current-dollar estimates.

The MARs for services are the smallest of all PCE components shown. The MARs for current-dollar services are 0.95 percentage points for the advance, 0.92 for the second estimates, and 0.82 percentage point for the third estimate. The MARs for the corresponding real estimates are slightly smaller, but the differences are less than 0.10 percentage point. Moreover, the MARs for services decline with successive vintages.

The MARs for gross private domestic investment are relatively large; they range from around 4.40 percentage points to 4.92 percentage points for both current-dollar and real investment. Both increase by approximately half a percentage point from the advance estimate to the second estimates, then decrease by a quarter of a percentage point with the third estimate. The MARs for fixed investment are smaller; for current dollars and real estimates, the advance estimates are approximately 2.85 percentage points and the second and third estimates decrease by approximately 0.3 and 0.2 percentage point, respectively. The smaller sizes are the result of excluding inventory investment, which in a previous study using an alternative methodology, was found to have larger revisions (see Fixler, Greenaway-McGreevey, and Grimm (2011).

Within fixed investment, MARs for current-dollar nonresidential range from 3.17 percentage points to 3.53 percentage points, while real nonresidential MARs increase from 3.11 percentage points to 3.44 percentage points from the advance estimate to the third estimate. Additionally, MARs for current-dollar structures decrease from 8.58 percentage points for the advance estimates to 7.94 percentage points for the second estimates and 7.57 percent for the third estimates. MARs for real structures are slightly lower, decreasing from 7.74 percentage points for the advance to 7.07 percentage points for the second estimates and 6.54 percentage points for the third estimate.

In the category equipment and intellectual property products, MARs for the current-dollar revisions range from 3.30 percentage points to 3.69 percentage points over the three vintages and decrease from the advance estimate to the third estimate. MARs for the real estimates are slightly higher and range from 4.06 percentage points to 4.54 percentage points, again decreasing from the advance estimate to the third estimate.

The MARs for residential investment are in general approximately within the same range as those for equipment and intellectual property products investment. These estimates decline across successive vintages for both current-dollar estimates and real estimates.

The MARs for current-dollar estimates of exports and imports for the three current quarterly vintages range from 2.76 percentage points to 3.68 percentage points. The MARs for the corresponding real estimates tend to be slightly smaller, with very small differences for each vintage.

The MARs for government consumption expenditures and gross investment hover around 1.60 percentage points for all vintages. The MARs for federal expenditures for current-dollar estimates and real estimates are higher. They range from 2.98 to 3.17 percentage points; these estimates are relatively constant across all three vintages. The MARs for defense range from 4.16 percentage points to 4.20 percentage points in current dollars and from 4.04 percentage points to 4.10 percentage points in real dollars. Finally, the MARs for nondefense spending increases slightly across the three vintages for real estimates from 3.64 percentage to 3.87 percentage points, and current-dollar estimates hover around 4.20 percentage points. Finally, the MARs for state and local government expenditures for the current-dollar advance estimate is 1.80 percentage points and around 1.63 percentage points for both the second and third estimates. The MARs for the real estimates decrease across vintages, from 1.58 percentage points for the advance estimate to 1.42 percentage points for the third estimate.

Tables 2 and 3 present expanded information about the MARs of real and current-dollar GDP, including MARs for all vintages of estimates to any later estimate.

Table 2 shows that the MARs for the various vintages of current-dollar estimates all increase with later vintages. Specifically, the increase in MARs is largest from earlier current quarterly estimates to the first annual estimate. The increase in MARs tapers off following the first annual estimate. The reason for the large jump in MARs from earlier estimates to the first annual estimate is that comprehensive revisions of the NIPAs are made in 5-year intervals; as a result, the first annual revision estimates contain the redefinitions and reclassifications about one-fifth of the time; the second annual revision estimates, about two-fifths of the time; and the third annual revision estimates, about three-fifths of the time. Thus, the MARs for the successive annual revision estimates increasingly reflect the effects of these changes as well as the incorporation of annual data that become available with 1- to 3-year lags.

These same patterns generally hold true for the major components and subcomponents of current-dollar GDP. The MARs for PCE tend to be modestly lower than those for GDP. The MARs for durables PCE are the largest among the subcomponents of total PCE. In fact, they are several times larger than the MARs for total PCE. MARs for nondurables PCE are almost twice the MARs of PCE but smaller than the MARs for durable goods. The MARs for services PCE is comparable to that of total PCE.

The MARs for gross private domestic investment are substantially larger than the MARs for all other components of GDP and about 5 times as large as the MARs for current-dollar GDP. The MARs for fixed investment are about half of the corresponding MARs for gross private domestic investment; the differences reflect the exclusion of change in private inventories, which are subject to large unmeasurable (in percent changes) revisions. Within fixed nonresidential investment, the MARs for nonresidential structures investment is about twice as much as the MARs for equipment and intellectual property investment. For residential investment, the size of the MARs are comparable to those for nonresidential investment. However, the MARs for residential investment from the earlier vintages to the first annual estimates and beyond are larger than those for nonresidential investment.

For exports and imports, the patterns of MARs are roughly the same and similar to the patterns for the other components of GDP, with the MARs of the advance estimate to later vintages of imports estimates being just slightly larger than those for exports.

The patterns of MARs for government consumption expenditures and gross investment are also similar to those for GDP, though slightly larger. The patterns of MARs for the components and subcomponents of government are roughly similar to the aggregate but with slightly larger values, particularly for federal nondefense, and the first two estimates of state and local government. The later MARs estimates for state and local government are smaller than the MARs of other government components and more similar in size to the MARs for the government consumption expenditures and gross investment aggregate. Relatively little new information comes in for the later estimates, and the sizes of the revisions reflect this fact.

In summary, there are several characteristics of the pattern of revisions that hold true generally but not universally. The difference in revisions from the second to the third vintages of estimates are typically the smallest for most adjacent pair of vintages. For any given vintage, the MARs tend to increase as later and later vintages include more comprehensive revisions. For any vintage of later estimates, the MARs tend to decrease in size as later and later vintages of estimates are used as the earlier estimates that have already incorporated a lot of information. That is, for any row (vintage of earlier estimates) in table 2, moving across columns of later and later estimates results in increasingly large MARs. Also, for any given column (vintage of later estimates), going down rows generally results in decreasing MARs.

Table 3 presents MARs for real GDP and its components. Both the sizes and the patterns of the MARs are close to those of MARs for current-dollar GDP. The similarities reflect the fact that revisions to deflators used to construct the real estimates are quite small in comparison to the current-dollar revisions, and most estimates of the components of GDP are made in current dollars and then deflated.

Table 4 provides additional information about the revisions from earlier vintages of estimates to the latest estimates. Columns 1 and 2 show the standard deviations of the latest estimates of real and current-dollar GDP and its components and subcomponents at the same level of detail as in table 1 and table 2. These are measures of the volatility of the estimates. They range from 1.47 percentage points for real PCE services to 15.48 percentage points for current-dollar structures investment.

Columns 3 and 4 of table 4 expand upon columns 5 and 6 of table 1 by presenting standard deviations of the three annual revision vintages of estimates. Again, the standard deviations show wide variation in size, ranging from 0.70 percentage point for current-dollar PCE and 0.61 percentage point for real PCE to 10.32 percentage points for current-dollar structures investment and 9.78 percentage points for real structures investment. The standard deviations may be scaled by dividing the standard deviations of the revisions by the standard deviation of the estimates in order to facilitate comparisons among components. When the standard deviations of revisions are scaled, the resulting standard deviations for each component and subcomponent have relatively small ranges. from 0.13 percentage point (imports) to 0.99 percentage point (federal nondefense) for current-dollar GDP and 0.14 percentage point (residential investment) to 1.23 percentage points (state and local) for real GDP. Thus, the more volatile components of GDP (those with larger standard deviation of estimates) also tend to have more variable revisions (larger standard deviation of revisions).

Mean revisions

In addition to MARs, table 1 presents MRs for real and current-dollar GDP. The reported MRs are generally much smaller than the MARs because positive and negative revisions tend to offset each other, and MARs are functions of both the MRs and the variances of the estimates, which by definition are positive. For most of the measures shown in table 1, there is little or no tendency for MRs to become smaller with successive vintages of estimates. Most of the MRs for GDP and its components reflect definitional changes that are part of comprehensive revisions made to improve the estimates, and these revisions have usually raised the rates of change of GDP and some components.

The MRs for both current-dollar and real GDP are generally negative. The MR for the advance estimates of current-dollar GDP is 0.17 percentage point, and the MR for real GDP is −0.01 percentage point. The MRs for the second and third current-dollar estimates of GDP are 0.05 percentage point and −0.03 percentage point, respectively, and the corresponding MRs for real GDP are −0.06 percentage point and −0.12 percentage point, respectively. The MRs for the components and subcomponents of real and current-dollar GDP are generally small, except for the MRs of current-dollar structures investment, which are substantially larger. The signs of the MRs vary across subcomponents and often vary with successive vintages.

Table 1 also presents the standard deviations of revisions. They are somewhat larger than the MARs for GDP and its components. The pattern observed for the MARs discussed in the previous section holds true in general for the standard deviations as well. For example, within PCE, durables have the largest standard deviations while services have the smallest. Also, the largest standard deviations are observed for gross private domestic investment.

The standard deviations in table 1 can be used to test whether the MRs are statistically significant.3 The revisions that underlie the reported MRs appear to be normally distributed, and the MRs are not statistically different than zero for GDP or any of its components or subcomponents. This is true for current-dollar as well as real GDP.

Tables 5 and 6 offer supplemental pictures of the revisions. Table 5 focuses on the advance to latest estimate of real GDP and its major components, providing additional information about the distribution of the revisions. In table 5, the number and proportion of observations that fall within the 1 standard deviation range of the MRs are reported. The revisions appear to hover around the mean and the proportions approximate what one would observe for normally distributed random variables.

In the 2013 comprehensive revision of the NIPAs, a new category of investment called intellectual property products (IPP) was added; this category consists of research and development, entertainment, literary, artistic originals, and software. This last component, software, was also measured before but was bundled with equipment in a category called equipment and software.

Charts 3 and 4 make the case for creating a new category on its own, instead of expanding equipment and software to equipment and IPP. For example, chart 3 compares the latest estimates of growth of nominal equipment and software and IPP since the series of IPP is publicly available, and by looking at the chart one can see that the growth rates of both types of investment are quite different both in their levels and in their cyclical properties.

This is not surprising since investment in equipment is usually expensive and lumpy, while investment in IPP is not, and the depreciation rates of both types of investment are also significantly different.4

Moreover, chart 4 also shows that the quarterly revisions between the advance estimate of each series and its latest growth estimates, although similar in magnitude, are quite different through time and don't follow similar patterns.

Now we turn our focus on the properties of the IPP growth estimates that BEA has published since 2013.

Charts 5 and 6 show the evolution of the real and nominal growth estimates of IPP, from the advance quarterly estimates to the latest ones available.

Chart 5 shows that even though the early estimates of real and nominal growth, as well as the latest estimates, are positively correlated, there is no systematic pattern between the advance and the latest series.5 Sometimes advance estimates of growth are above final estimates, but some other times they are below. And similarly, real and nominal growth estimates cross multiples times throughout the sample, showing no systematic pattern between the real and nominal series either.

Lastly, chart 6 shows the differences between the latest and advance growth estimates for the real and nominal IPP quarterly growth. As one can see, the differences in the revisions from latest to advance are very similar for both the nominal and real estimates, (with maybe a noticeable gap between the two series at the end of 2015 and the beginning of 2016, when there was a strong real growth in IPP), with a much more modest growth in nominal terms, indicative of falling prices at least for some of the components inside the IPP category.

Table 7 presents the MRs and MARs of the revisions to quarterly estimates of GDI. Note that some categories of GDI are revised more than others, like national income and corporate profits, whose first estimates are released at the same time as the third revisions of the other components of GDI. Consequently, advance estimates of GDI are not prepared, and since 1995, second current quarterly estimates of GDI in the fourth quarter of each year have not been prepared either. The reason that these estimates of GDI are not produced is mainly because there is a lack of reliable source data for estimating corporate profits, which affects several other components of GDI. Advance and second estimates are prepared for the other major components of GDI, such as compensation of employees and proprietors' income.

Beginning with the first quarter of 2002, a fourth vintage of the estimates for the previous quarter has been prepared using data from the Bureau of Labor Statistics (BLS) QCEW. These data are used to revise the estimates of compensation of employees, national income, and GDI. The fourth vintage is released at the time of the second quarterly estimate of the following quarter. However, the fourth vintage MR and MAR numbers are not shown in table 7.

The MAR for the early annual estimates of GDI (to the latest estimate) is 1.62 percentage points. It increases with the first annual estimates and then declines with the second and third annual estimates. This is comparable to the MAR for real GDP (table 3), which shows an increase from the advance estimates to the third estimates before declining for the subsequent estimates. The MARs for most of the components of GDI tend to decline with successive vintages of estimates, with a few exceptions: private consumption of fixed capital, proprietors' income, and nonfarm proprietors' income. Another notable observation from table 7 is that the MARs for GDI components can be significantly larger than the MAR for total GDI. For example, the MARs for corporate profits is about 10 times larger than those for GDI, and the MARs for private consumption of fixed capital is about 5 times larger than those for GDI.

The MRs for GDI and its major components are much smaller than the corresponding MARs, and the MRs for any given component can have a mixture of positive and negative signs for different vintages. Private consumption of fixed capital and corporate profits have only negative MRs for all vintages; in contrast, taxes on production and imports, compensation of employees, proprietors' income, and nonfarm proprietors' income have only positive MRs for all vintages. Additionally, like the GDP shown in table 1, the MRs do not tend to decline with successive vintages of estimates.

Table 8 presents additional information on the standard deviation measures of GDI estimates. The scaled standard deviations provide a sense of how volatile the revisions can be, taking into account the variability of the estimates themselves. The scaled standard deviations for GDI are rather small, all less than 1 percentage point for all vintages. For all other components of GDI, the scaled standard deviations are similar to those of aggregate GDI. However, in stark contrast to all the other components, the scaled standard deviations of private consumption of fixed capital are above 6 percentage points for all vintages, and the standard deviations gets larger with each successive vintage.

Table 9 presents the full MAR triangle for GDI and its major components. Note that a fourth vintage is presented for compensation, national income, and total GDI that incorporates data from the QCEW. However, the fourth statistics should be viewed with caution because no fourth estimates were prepared before the first quarter of 2002, and the statistics are not fully compatible with the rest of the entries, which are for 1996–2018. Thus, the MARs for the fourth estimates are not discussed.

The patterns of MARs for successive vintages of estimates and vintages of standards in table 9 for GDI are, for the most part, similar to those for GDP. For example, the MARs for GDI from the third estimates to the third annual estimates decline overall, but the MAR of the first annual estimate doesn't follow this pattern. The increases in MARs for the various vintages of GDI estimates are not monotonic when later vintages of estimates are used as standards. Lastly, the whole set of MARs for GDI tends to be modestly larger than the comparable ones for GDP.

Of all the components of GDI, the MARs reported for compensation of employees appear to be the smallest, with noticeably sharp drops in MARs moving from the third estimate to the fourth estimate, regardless of which vintage is used as the standard. MARs for national income are similar to those of compensation of employees; they also show that sharp drops in MARs occur from the third estimate to the fourth estimate, again regardless of which vintage is used as the standard. Even though the MARs for national income and compensation of employees are relatively small compared to the other GDI components, they are still larger than the MARs reported for aggregate GDI.

The patterns for GDI and its components are generally similar to the patterns for current-dollar GDP, but there are deviations; that is, the MARs for GDI and its components sometimes decrease from vintage to vintage used as the standards (rows) or increase from vintage to vintage of the estimates for any given standard (columns). Nevertheless, there is a tendency toward increases across rows of table 9 and decreases down the columns. In general, the MARs for the various components and vintages tend to be larger than those for the various GDP components.

Table 10 and table 11 present supplemental information about the revisions for GDI. Table 10 focuses on the third to latest estimate of GDI and its major components and provides additional information about the distribution of the revisions. The revisions appear to hover around the mean and the proportion of revisions within one standard deviation from the mean approximate to what one would observe for normally distributed random variables.

In the United States, the National Bureau of Economic Research (NBER) Business Cycle Dating Committee maintains a chronology of U.S. business cycles. NBER emphasizes that a recession involves a significant decline in economic activity that is spread across the economy and lasts more than a few months. In practical terms, NBER uses three criteria—depth, diffusion, and duration—to determine the onset, length, and end of a recession, and these criteria are at least somewhat interchangeable.

In choosing the dates of business cycle turning points, NBER follows real GDP as the single best measure of aggregate economic activity, paying close attention to the two ways BEA measures this concept, from the product side (real GDP) and from the income side (real GDI). Additionally, NBER also considers total payroll employment as measured by BLS as well as other timelier economic indicators.

Lastly, and to avoid the need for major revisions, NBER waits to determine the date of a peak in activity, and thus the onset of recession, until confident that, even in the event that activity began to rise again immediately, a recession has occurred. Although this delay is understandable, a timely public policy and decision making are key to mitigate the effects of recessions, and that is why there are numerous models in different public institutions estimating the probability of being in a recession.6

Analogously, the same process takes place determining the end of a recession, so effectively there is a delay of a number of months identifying the peaks and troughs of recessions.

Our sample period from 1996 to 2018 covers two of the last recessions registered by NBER, the first one started in March 2001 (first quarter), and finished in November of the same year (fourth quarter), and the second one was the Great Recession, that officially started in December 2007 (fourth quarter) and ended in June 2009 (second quarter).

Galvão and Kara (2020) find that data revisions to U.K. GDP lead to a turning point chronology that has fewer and shorter recessions than real-time chronologies that rely on early estimates of GDP.

As mentioned before, in the United States, NBER determines the length of recessions with a very significant delay, as seen in table 12, so the chronology of recessions is never revised, but we can see ex-post how our knowledge of a recession changed with time and data.

For example, by comparing the early and the latest vintages of quarterly growth produced by BEA as they were being released during recessions, we can explore whether the growth rates from beginning to end of a recession vary by vintages and how our knowledge of a recession was changing at the time of the recession. Of special interest would be to see if in line with the U.K. findings of shorter recessions, we find that recessions in the United States also turn out to be shorter or milder when we use more recent vintages containing more data and information.

BEA publishes early estimates of quarterly GDP growth the month after the quarter has passed so early estimates of a first, second, third, and fourth quarter are published in April, July, and October of a given year, and January of the following year. These are the estimates in the left panel of chart 7.

The second estimates come a month after the early estimates, and the third ones, a month after the second ones, so for example, by the time the early estimate of third quarter is released in October, the third estimate of the second quarter is already available (as it was released in September), and so on.

Other papers, like Jordà et al. (2020) try to predict the size of the GDP growth revisions from the advance to the second and third revisions, using complementary economic indicators and economic theory, to improve economic policy making in real time, especially during recessions.

Although their approach is very valuable, in this paper we take a different turn and focus exclusively on BEA-released data to analyze when we knew what in times of economic turbulence.

Thus, the right panel of chart 8 shows the third estimates during the Great Recession.

The differences between the two panels in chart 7 are visible, but small, in the sense that they didn't significantly change our picture of what was happening at the time. For example, by looking at the third revisions, the peak or turning point from the previous expansion didn't change from the early to the third revision and continued to be the fourth quarter of 2007, although the growth rates leading to this quarter had been stronger than we originally thought. Similarly, the growth in the second quarter of 2008 was stronger than anticipated, but the last quarter of 2008 and first quarter of 2009 were both very negative and similar in magnitude; although the first quarter of 2009 was slightly less negative than the last quarter of 2008, it was too close to affirm that the worst had passed already and we had begun to turn a corner.

BEA also conducts annual revisions (three in total) of quarterly growth every July; that is, the first annual revisions of all quarter growth estimates in a year are released in July of the following year. Moreover, the first major revision of a first-quarter growth estimate in a year is also released in July of that same year, together with the first annual revisions from the previous year's quarters mentioned above.

In chart 8 below, the left panel shows the first major revision of the first quarters and the first annual revisions of the second, third, and fourth quarters around the Great Recession. The panel on the right shows the latest growth estimates available, which include not only all the data available many years later, but also comprehensive revisions to how we measure GDP.

The first major annual revisions of the last quarter of 2008 and the first quarter of 2009 shown in the left panel were released at the end of July 2009, timidly reversing the idea that the worst had happened in the last quarter of 2008, and instead hinting that the economy had continued worsening during the first quarter of 2009. The first annual revision also highlighted for the first time that economic activity during the third quarter of 2008 had contracted significantly.

Subsequent annual revisions of the quarters around the Great Recession didn't change the picture on the left panel of chart 8, so with this and other economic data, NBER dated the peak and trough around the Great Recession as December 2007, at the end of the fourth quarter of that year, and June 2009, at the end of the second quarter. However, after the later comprehensive revisions shown in the right panel of chart 8, the length of the Great Recession was shorter than what earlier estimates signaled, in line with what Galvão and Kara (2020) found for the United Kingdom, but also deeper than what we previously believed.

Next, we conduct a similar graphical analysis of the data released during and after the “high tech” bust of 2001 in charts 9 and 10, showing data from 2000 Q4 to 2002 Q1. The peak and trough of this recession seems to have changed when comparing the four panels of these charts, with only the latest estimates of quarterly growth shown in the right panel of chart 10 coinciding with the turning points dates announced by NBER and shown in table 12.

However, when looking at the depth of the recession with the luxury of time, the latest estimates of the high tech bust show a deeper fall in economic activity than what we believed was going on at the time of the recession, just as charts 7 and 8 showed for the Great Recession.

Table 13 shows summary statistics for revisions of the different annual estimates to the latest estimates for both real and current-dollar GDP and their major components. Like the quarterly estimates, the MRs for the annual estimates are small in magnitude and vary in direction. The early annual estimates are those that become available in March of the following year and are mostly composed of third current quarterly estimates, excluding the fourth-quarter numbers, and thus their MRs are similar to those of previous quarterly estimates.

The MARs are considerably smaller than the current quarterly estimates for both real and current-dollar GDP. One important reason is that annual frequency estimates are not subject to revisions to seasonal adjustment factors, and the other, of course, is that they aggregate activity during a longer period of time where misses in one quarter are going to be captured in another quarter, or simply compensated for by opposite misses in another. The MARs show a tendency to decline over each vintage both in current-dollar and real-dollar estimates. Moreover, MARs also decline steadily across vintages for the underlying components. As with total GDP, the MARs of components are much smaller than those for quarterly estimates.

Revisions in definitions, as part of comprehensive NIPA revisions, have generally resulted in increases to the growth rates of GDP. For example, the introduction of software as investment in the 1999 comprehensive revision raised both the levels and the growth rates of GDP. However, the introduction of research and development (R&D) and artistic originals as investment in the 2013 comprehensive revision raised the levels of GDP but did little to change the growth rates. The reason is that once R&D and artistic originals were included in the early estimates of 2013 in the new investment category of IPP, together with software, posterior revisions of this investment were incorporated into the years prior to 2013, in order to acknowledge that these products existed before and had been growing in importance before they were introduced in the national accounts. See the section on IPP above where we discuss in detail the introduction of the IPP category, separate from investment in equipment. Lastly, for a broader discussion about the effects of comprehensive revisions on the estimates, see Fixler, Greenaway-McGreevey, and Grimm (2014).

Table 14 presents summary statistics for revisions of annual estimates of GDI growth rates and its major components. The estimates are expressed in terms of percentage points of the annual estimates. As with the quarterly estimates, the MRs are small in magnitude and vary in sign. The early annual estimates are similar to the third current quarterly estimates, and their MRs are therefore similar to those of previous estimates of the same vintage.

The MRs for GDI and its components are a fraction of the size of those for the current quarterly estimates. The same pattern is observed when comparing annual estimates of GDP with the quarterly estimates, and the same reasoning applies: annual estimates are not subject to revisions based on changes to seasonal adjustment factors. The MARs for GDI tend to decline with each successive estimate. These declines reflect the incorporation of newly received annual source data that is more complete or revised from earlier data. In particular, the second annual vintage is the first to incorporate data from the Statistics of Income database from the Internal Revenue Service (IRS), so the drop in MARs should be most noticeable moving from the first annual estimate to the second annual estimate for GDI and its components. For example, the MAR for GDI declines 0.23 percentage point from the first annual estimate to second annual estimate and then increases slightly to the third annual estimate. The MAR for net interest is notable, dropping 4.16 percentage points from the first annual estimate to the second annual estimate.

In national economic accounting, GDP and GDI are conceptually equal, but practically different. GDP measures overall economic activity by final expenditures, and GDI measures economic activity by the incomes generated from producing the goods and services inside GDP. In practice, GDP and GDI differ because they are constructed using different sources of information, both imperfect in different ways. The different source data produce different results for a number of reasons, including sampling errors, coverage differences, and timing differences with respect to when expenditures and incomes are recorded.7

Charts 11 and 12 show that for most of our sample, estimates of real GDP and GDI growth give a similar picture of the economy.

In the end, GDP and GDI are two ways of measuring aggregate economic activity, so while some of BEA's data users prefer one over the other, many users like to have both sets of information since they complement each other. Thus, in response to this demand from its data users, in July 2015 BEA started publishing a new series that averages GDP and GDI, providing users another way to track U.S. economic growth.

Table 15 presents the MARs and MRs of the current-dollar average of GDP and GDI series from the third estimate onwards compared with subsequent annual revisions and the latest available data.8 Because of the delay in the release of GDI estimates compared to GDP estimates, and in order to conduct a fair comparison across the vintages, the sample period used in the table runs from 1996 to 2018.9

The top half of the table reports the MARs of the average series and the stand-alone ones, and as expected, the numbers increase as later and later vintages are used as the standard. The MAR (and the corresponding MR) appear to be smallest going from the second annual estimate to the third annual estimate because by the second annual estimate, most of the information has been incorporated into the estimates already, and there are very little trend-based estimates and very little use of indirect sources. Moreover, most direct indicators have also been replaced by comprehensive data, so the change in data moving from the second annual revision to the third annual revision is minimal and involves only some additional conversion of direct indicators to comprehensive data.

The data shows that the MARs of the average series are smaller than the MARs for GDI for all vintages and smaller or equal to most of the MARs for GDP. If both GDP and GDI are interpreted as the sums of the unobserved true economic activity and of measurement errors, it is possible to infer that the weighted average series of the two is a more reliable measure of activity than either GDP or GDI alone. This occurs, of course, because some of the measurement errors are averaged out, reducing subsequent revisions in the weighted average. However, the better reliability observed using the average of GDP and GDI will not lead to a better understanding of the detailed workings of the economy because there is no obvious way of distributing the results of the averaging among the major components of GDP and GDI. Thus, the average series can only provide supplemental summary information about the recent course of the aggregate economy.

The bottom half of the table reports the MRs of the series of interest, and as one can see that for the third revision, MRs increase when the first annual estimate up to the third annual estimate are used as standards, but for the latest available estimate, the MR declines. Specifically, the MR for the average of GDP and GDI is −0.04 percentage point for the third vintage when the first annual revision is used as standard, and it increases to −0.14 percentage point when the second annual revision is used as standard instead, and increases a bit more when the third annual revision is standard. The MR then falls to −0.02 percentage point when comparing the latest estimate to the third annual estimate.

For the second annual vintage, the MR increases when the latest estimate is used as the standard rather than the third annual (from −0.07 percentage point to 0.12 percentage point). The same patterns are observed if we look at current-dollar GDP and GDI series on their own (just reported below in table 15 as well). The pattern may be explained by the fact that on average, for both GDP and GDI (and therefore, the average of GDP and GDI), the revisions are negative as later vintages are used as the standard except when moving from the third annual estimate to the latest available estimate. The explanation for this is that the revision produced by the latest available estimate less the third annual estimate includes comprehensive revisions, and on average this tends to be a relatively large positive number, thus reversing some of the negative revisions observed for earlier vintage comparisons.

BEA seeks to provide a timely, comprehensive, and accurate picture of the U.S. economy. However, because of the flow of source data over time, BEA provides vintages of estimates. This study examines how those vintage estimates vary. Throughout this study, we have shown that the different estimates of GDP and GDI that BEA publishes are reliable and that the differences in vintages estimates arise mostly by new information and not by measurement errors. More specifically, we have shown that that the estimates of GDP and GDI have small MARs and MRs for most quarters in our sample, consistent with previous similar BEA studies.

The average of GDP and GDI series appears to be a more reliable measure of aggregate economic activity than GDP and GDI taken separately in the sense that the MARs of the average series are smaller or equal compared to those of GDP and GDI taken individually. Moreover, when we take the MRs from the latest to the third estimates for GDP and GDI individually and compare the sign of these revisions side by side in every quarter in our sample, we find that in more than a third of cases, the latest revisions of GDP and GDI moved in opposite directions, that is, by looking at the third revisions, we were underestimating GDP and overestimating GDI or overestimating GDP and underestimating GDI. In these cases, combining these estimates is particularly useful to give us a more accurate picture of the economy, and it clearly shows that the data sources we use to compute these two estimates are different and convey information that is not always captured “properly” or in a timely manner in the other estimate, indicating that the estimates of GDP and GDI are complements.

However, the average series can only be constructed in the aggregate and with a delay because the results of the average cannot be distributed among the major components of GDP and GDI. A by-product of our analysis is a demonstration of the uncertainty surrounding the BEA estimates. We have shown that revision magnitudes can be used to construct confidence intervals. We have also shown that uncertainty surrounding estimates of turning points does not impact the qualitative nature of the analysis.

 


  1. For example, the accounts contain no entries for business investment in computer software before 1959; the amount of software prior to that year was negligible. But by 2012, business investment in software was 1.7 percent of the size of GDP; the level and rates of growth of the economy were raised when software investment was first included in the accounts in the late 1990s.
  2. The time period for this study is 1996–2018. A study of the most immediate revisions by Fixler, Kanal, and Tien (2018) covered data from 1993–2015. The years 1993–1996 from the last study do not exhibit any trends that affect the current study, so the sample years have been rolled over 3 years toward the present to make comparisons consistent with previous studies.
  3. Given a normal distribution, the standard deviation is a component of the statistic that tests the hypotheses that the MR is significantly different from zero.
  4. See comprehensive estimates of depreciation rates by type of assets in the paper “BEA Depreciation Rates.
  5. The correlation between real and nominal IPP makes sense because nominal growth includes real growth (quantity) and price growth.
  6. See, for example, Pike and Vazquez-Grande (2019) or Ergungor (2016).
  7. For a detailed discussion of the timing of GDP and GDI estimates and the source data available for each estimate, see Fixler, Greenaway-McGreevey, and Grimm (2014) and Holdren (2014).
  8. GDI is not available when the advance estimates of GDP are released, and it is only available for the first three quarters of the year when it is released with the second estimate of GDP. Therefore, for a comparable and continuous quarterly average series, the third estimate would be the earliest vintage.

 

    Ergungor, O.E. 2016. “Recession Probabilities,” Federal Reserve Bank of Cleveland, Economic Commentary 2016–09 (August).

    Fixler, D.J., D. Kanal, and P-L.Tien, “The Revisions to GDP, GDI, and Their Major Components,” Survey of Current Business 98 (January 2018).

    Fixler, D.J., R. Greenaway-McGreevey, and B.T. Grimm, “The Revisions to GDP, GDI, and Their Major Components,” Survey of Current Business 94 (August 2014).

    Fixler, D.J., R. Greenaway-McGreevey, and B.T. Grimm, “The Revisions to GDP, GDI, and Their Major Components,” Survey of Current Business 91 (July 2011).

    Galvão, A.B., and A. Kara. 2020. “The impact of GDP data revisions on identifying and predicting UK Recessions,” UK National Institute of Economic and Social Research, Economics Statistics Centre of Excellence Discussion Paper 2020–12 (July).

    Holdren, A.E., “Gross Domestic Product and Gross Domestic Income: Revisions and Source Data,” Survey of Current Business 94 (June 2014).

    Jordà, O., N. Kouchekinia, C. Merrill, and T. Sekhposyan. 2020. “The Fog of Numbers,” Federal Reserve Bank of San Francisco Economic Letter 2020–20 (July).

    Kelly, P.A, S.H. McCulla, and D.B. Wasshausen, “Improved Estimates of the National Income and Product Accounts: Results of the 2018 Comprehensive Update,” Survey of Current Business 98 (September 2018).

    Kuznets, S. 1948. “National Income: A New Version,” Review of Economics and Statistics 30 (August): 151–179.

    Manski, C. F. 2015. “Communicating Uncertainty in Official Economic Statistics: An Appraisal Fifty Years after Morgenstern,” Journal of Economic Literature 53 (September): 631–653.

    Pike, T. and F. Vazquez-Grande. 2019. “Out-of-Sample Performance of Recession Probability Models,” FEDS Notes 2019-12-13-1, Board of Governors of the U.S. Federal Reserve System.