Is the Taxable Income Elasticity Su¢ cient to Calculate Deadweight Loss? The Implications of Evasion and Avoidance Raj Chetty UC-Berkeley and NBER August 2008 Abstract Since Feldstein (1999), the most widely used method of calculating the excess burden of income taxation is to estimate the e¤ect of tax rates on reported taxable income. Feldstein’ taxable income formula for deadweight loss implicitly assumes that the marginal s social cost of evasion and avoidance equals the tax rate. This paper argues that this condition is likely to be violated in practice for two reasons. First, some of the costs of evasion and avoidance are transfers to other agents in the economy rather than real resource costs. Second, some individuals overestimate the costs of evasion and avoidance. I show that, in such situations, excess burden depends on a weighted average of the taxable income and total earned income elasticities, with the weight determined by the resource cost of sheltering income from taxation. This generalized formula implies that the e¢ ciency cost of taxing high income individuals is not necessarily large despite evidence that their reported incomes are highly sensitive to marginal tax rates. Keywords: excess burden, tax evasion, optimal taxation E-mail: chetty@econ.berkeley.edu. I have bene…ted from discussions with Alan Auerbach, Peter Diamond, Caroline Hoxby, John Friedman, David Gamage, Roger Gordon, Louis Kaplow, Adam Looney, Wojtek Kopczuk, Emmanuel Saez, Joel Slemrod, Shlomo Yitzhaki, and Philippe Wingender. Gregory Bruich provided outstanding research assistance. Funding from the Hoover Institution is gratefully acknowledged. In an in‡ uential pair of papers, Martin S. Feldstein (1995, 1999) showed that the excess burden of income taxation can be calculated by estimating the e¤ect of taxation on reported taxable income – the “taxable income elasticity.” Feldstein’ taxable income approach has since become the central s focus of the literature on taxation and labor supply because of its elegance and practicality. The approach is elegant because one does not have to account for the various channels through which taxation might a¤ect behavior (e.g. hours, e¤ort, training) to measure e¢ ciency costs. It is practical because tax records containing data on reported taxable income are widely available. The empirical literature on the taxable income elasticity has generally found that elasticities are large (0.5 to 1.5) for individuals in the top percentile of the income distribution, and relatively small (0 to 0.3) for the rest of the income distribution (see e.g., Lawrence B. Lindsey 1987, Joel B. Slemrod 1998, Jonathan Gruber and Emmanuel Saez 2002, Saez 2004). This …nding has led some to suggest that reducing top marginal tax rates would generate substantial e¢ ciency gains.1 The taxable income reported by high income individuals is very sensitive to the tax rate partly because of tax avoidance and evasion (Slemrod 1992, 1995).2 For example, individuals make charitable contributions to reduce their taxable income or use unmonitored o¤shore accounts to under-report income. Does the e¢ ciency cost of taxation depend on whether the taxable income elasticity is driven by avoidance and evasion rather than changes in labor supply? Existing studies (e.g. Feldstein 1999, Slemrod and Shlomo Yitzhaki 2002, Saez 2004) suggest that the answer is no, as long as there are no changes in tax revenue from other tax bases. For example, Slemrod and Yitzhaki remark that “Feldstein’ (1999) claim about the central importance of the elasticity s of taxable income generalizes to avoidance and evasion.” The intuition underlying this conclusion is straightforward: an optimizing agent equates the marginal cost of sheltering $1 of income from taxation with the net marginal cost of reducing earnings by $1, so the reason that reported taxable income falls does not matter for e¢ ciency calculations. This paper reevaluates the taxable income elasticity as a measure of deadweight loss in the presence of evasion and avoidance (“sheltering”behaviors).3 Feldstein’ formula implicitly requires s that the marginal social cost of sheltering $1 of income equals the tax rate (the bene…t of sheltering Academic examples include Gruber and Saez (2002) and Feldstein (2006). The Joint Economic Committee (2001) has argued in favor of lowering top rates based on the taxable income evidence. See Austan Goolsbee (1999) for a critique of the empirical literature on taxable income. 2 Income shifting can also occur intertemporally. When tax changes are anticipated, individuals appear to retime income substantially (Goolsbee 2000). I abstract from such intertemporal e¤ects, focusing on the question of how to measure e¢ ciency costs using estimates of the long-run e¤ect of taxes on behavior. 3 The distinction between illegal evasion and legal avoidance is not critical for the analysis in this paper, so I use the term “sheltering” as a general description of all evasion and avoidance behaviors. 1 1 $1). This condition is likely to be violated in practice for two reasons. First, and most importantly, some of the costs of sheltering are transfers to other agents in the economy rather than real resource costs. For instance, an individual may be deterred from tax evasion because of the expected cost of being …ned by the government or other agents in the private sector. Individuals seeking to avoid taxation by making charitable contributions or setting up trusts for their descendants may not fully internalize the bene…ts associated with their contributions, e¤ectively incurring a transfer cost for sheltering income. Second, empirical studies have found that individuals overestimate the costs of sheltering –e.g. overestimating the detection probability and …nes for tax evasion (James Andreoni, Brian Erard, and Jonathan Feinstein 1998). Such optimization errors also create a di¤erence between the true marginal social cost of sheltering and the tax rate. The taxable income formula for deadweight loss does not hold when the marginal resource cost of sheltering di¤ers from the tax rate. Indeed, if sheltering has no resource costs, it generates no e¢ ciency loss at all because it simply leads to a reallocation of resources across agents. In this case, deadweight loss depends purely on the total earned income elasticity –the e¤ect of taxes on “real” choices that a¤ect total earnings. In the general case where sheltering has a positive resource cost that is not necessarily equal to the tax rate, I derive a simple formula for marginal excess burden that depends on a weighted average of the reported taxable income and total earned income elasticities. The weight is proportional to the loss in social surplus from of an additional dollar of sheltering –the marginal resource cost. Intuitively, reductions in total earnings caused by taxes always generate excess burden because they distort aggregate output. The additional excess burden generated by sheltering behaviors –which lead to a di¤erence between earned and taxable income –is proportional to the marginal resource cost of such behaviors. The formula developed here applies irrespective of whether transfer costs or optimization failures create the wedge between the marginal resource cost and the tax rate. The formula is also una¤ected by revenue o¤sets (transfers to the government) that occur through shifting of income across tax bases. In this sense, the analysis generalizes Feldstein (1999) by providing a robust method of calculating marginal excess burden when the perceived private cost of sheltering di¤ers from its social cost. The results in this paper have several precedents in the literature, notably in the work of Slemrod and co-authors. It is widely recognized that the calculation of excess burden is complicated by revenue o¤sets in the presence of multiple taxes (e.g. Slemrod 1998, Roger H. Gordon and Slemrod 2 2002, Slemrod and Yitzhaki 2002, Alan J. Auerbach and James R. Hines Jr. 2002, Saez 2004). Slemrod (1998) and Saez (2004) propose formulas that adjust for revenue o¤sets by adding terms for the change in revenue from other taxes. In addition, Slemrod (1995) and Slemrod and Yitzhaki (2002) observe that …nes lead to a di¤erence between the private and social costs of evasion. They note that this di¤erence will create an added term that must be taken into account when calculating excess burden, but do not characterize that term formally. This paper contributes to the taxable income elasticity literature in three ways. First, it shows how transfers between agents within the private sector a¤ect the calculation of excess burden. Existing formulas that adjust for revenue o¤sets are not valid in the presence of private transfers. Second, unlike earlier formulas, the formula derived here accommodates optimization errors in sheltering. Third, even ignoring within-private-sector transfers and optimization errors, the formula o¤ers an alternative approach to measuring the marginal excess burden of taxation in the presence of revenue o¤sets. This alternative representation permits calculation of the excess burden of an income tax change without characterizing its e¤ects on other tax bases, as would be required to implement the Slemrod and Saez formulas. It also yields some new intuition into the key determinants of excess burden. For instance, in the extreme case of pure transfer costs, the analysis shows that sheltering has zero e¢ ciency cost when the added term mentioned by Slemrod and Yitzhaki is taken into account, completely severing the link between the taxable income elasticity and excess burden. The results in this paper point to the marginal resource costs of avoidance and evasion as key parameters to be estimated in empirical studies of taxation. Since the resource costs of sheltering could potentially be much smaller than top marginal tax rates, one cannot conclude that the e¢ ciency cost of taxing high income individuals is large directly from existing evidence of large taxable income elasticities. I Theoretical Analysis This section presents formulas for the marginal excess burden of a linear income tax under various assumptions about the costs of sheltering. As a reference, I …rst derive Feldstein’ (1999) formula s in a model without sheltering. Section I.B considers a model where individuals can avoid or evade taxes by paying a real resource cost. In section I.C, I analyze a model where sheltering has no resource cost but requires a transfer to another agent. Section I.D presents a general formula for marginal excess burden when sheltering has both resource and transfer costs. 3 In section I.E, I show that this general formula is una¤ected by optimization errors in sheltering decisions. Finally, section I.F considers the implications of the e¢ ciency analysis for optimal taxation. To simplify the exposition, I abstract from income e¤ects by assuming quasilinear utility in the main text. In Appendix A, I analyze a model with curved utility and show that the same formula is obtained with the uncompensated elasticities replaced by compensated elasticities. I.A Benchmark Model: No Sheltering Consider the canonical static labor-leisure model, where an individual chooses how many hours to work (l) at a …xed wage rate w. Let t denote the tax rate on labor income, y unearned income, c consumption, and (l) the disutility of labor. Let R(t) = twl denote tax revenue. The individual’ s problem is to (1) max u(c; l) = c l (l) t)wl s.t. c = y + (1 As is standard in excess burden calculations, the conceptual experiment I consider is to measure the net dollar-value loss from raising the tax rate and returning the revenue lump-sum to the taxpayer. For this purpose, I de…ne social welfare as the sum of the individual’ utility (which is a money s metric given quasilinearity) and tax revenue: (2) W (t) = fy + (1 t)wl (l)g + twl Since the agent has chosen l to maximize utility, the envelope condition implies that an increase in t has only a mechanical …rst-order e¤ect on the agent’ utility (i.e. s du dt = @u @t = wl). Hence, behavioral responses can be ignored when di¤erentiating the term in the curly brackets, yielding the following expression for the marginal excess burden of taxation: dW dt (3) = = t wl + wl + t dT I dt dW dt d[wl] dt where T I = wl is taxable income. Feldstein (1999) showed that = t dT I even in a model where dt Thus, individuals make a vector of decisions (l1 ,...,ln ), such as hours, training, and occupation. the taxable income elasticity ( dT I ) is a su¢ cient statistic to calculate deadweight loss in a general dt 4 multi-input labor supply model. I return to the multi-input case in greater detail in Section II.A. I.B Sheltering with a Resource Cost Suppose the individual can shelter e dollars of income from taxation by paying a resource cost g(e). The sheltering of e could be accomplished either through legal tax avoidance (e.g. setting up a trust) or illegal tax evasion (e.g. under-reporting). The cost g(e) re‡ ects the economic opportunity cost of sheltering e, i.e. the loss in total output from this behavior. For example, g(e) could re‡ ect the loss in pro…ts from transacting in cash instead of electronic payments or the cost of choosing a distorted consumption bundle to avoid taxes. The individual now chooses both labor supply (l) and how much income to shelter (e): (4) max u(c; l; e) = c l;e (l) g(e) t)(wl e) + e s.t. c = y + (1 Social welfare is: (5) W (t) = fy + (1 t)(wl e) + e (l) g(e)g + t(wl e) Since the individual chooses l and e to maximize the term in curly brackets, we can again ignore behavioral responses when di¤erentiating the term in the curly brackets. Hence we again obtain the Feldstein formula: (6) where T I = wl dW = dt [wl e] + [wl e] + t d[wl e] dT I =t dt dt e is reported taxable income. From an e¢ ciency perspective, it does not matter if taxable income falls with t because of a change in labor supply (l) or a reporting e¤ect (e). Intuitively, the agent optimally sets the marginal cost of reporting $1 less to the tax authority (g 0 (e)) equal to the marginal private value of doing so (t). Since the agent supplies labor up to the point where his marginal disutility of earning another dollar equals 1 t, the marginal social value of earning an extra dollar (net of disutility of labor) is t. Hence, the marginal social costs of reducing earnings and reporting less income are the same at the individual’ optimal allocation, s making it irrelevant for e¢ ciency which mechanism underlies the change in T I. This is the intuition underlying studies which argue that the taxable income elasticity is su¢ cient to calculate 5 deadweight loss even in the presence of evasion and avoidance. I.C Sheltering with a Transfer Cost The leading example of such Part of the cost of sheltering may re‡ a transfer across agents. ect transfer costs are …nes levied for tax evasion. A …ne has a private cost to the tax evader but has no social cost if the agent is risk neutral, because it simply redistributes output from the agent to the government.4 If the agent is risk averse, the disutility associated with the increased uncertainty To simplify the exposition, I analyze the created by random audits constitutes a resource cost. risk-neutral case in this subsection and also assume that audits have no administrative cost. In Appendix A, I show that the general formula for excess burden in section I.D accommodates these factors if the utility cost of added risk and administrative cost of audits are included when measuring the marginal resource cost g 0 (e). I model audit deterrence of evasion using a simple variant of the framework developed by Michael G. Allingham and Sandmo (1972). Suppose that an individual is audited with probability p(e), where dp=de > 0 – egregious under-reporting may lead to a higher chance of being caught. If caught, the individual must pay his tax bill plus a …ne F (e; t). Let z(e; t) = p(e)[te + F (e; t)] denote the expected private cost of evasion.5 Assume that z is a strictly convex function of e and that @z @e (0; t) = 0 and @z @e (wl; t) = 1 to guarantee an interior optimum in e. Aside from these regularity assumptions, the derivation that follows does not depend on the speci…cation of z(e; t). Hence, although I have given a micro-foundation for z(e; t) in terms of auditing for concreteness, the results below apply for any transfer cost z, i.e. any cost of sheltering which has a positive externality of equal size on another agent. I give other examples of transfer costs in section II.B. The individual chooses e and l to (7) max u(c; l; e) = c e;l (l) t)(wl e) + e z(e; t) s.t. c = y + (1 This agent’ problem is formally identical to that in (4). However, there is a key di¤erence in the s social welfare function, in which the z(e; t) transfer externality now appears twice (with opposite 4 Only cash …nes are transfers; imprisonment generates a social cost both for the criminal and the government, which must maintain the prison. The formula derived in section II.D allows for such resource costs of punishment. 5 If z(e; t) is linear in t (i.e. z(e; t) = tz(e; 1) for all t), the tax rate has no e¤ect on sheltering: de = 0 (Yitzhaki dt 1974). The excess burden calculations accommodate this case as well as any other speci…cation of z(e; t) because the properties of the cost function z are captured in the empirically estimated elasticities. 6 signs): (8) W (t) = fy + (1 t)(wl e) + e z(e; t) (l)g + z(e; t) + t(wl e) Exploiting the now familiar envelope condition for the term in curly brackets yields (9) dW dt @z @z @z de d[wl e] + (wl e) + + +t @t @t @e dt dt dT I @z de dLI de @z = t + =t + ( t) dt @e dt dt dt @e = (wl e) where LI = wl is total (pretax) earned income. To simplify this expression, consider the …rst-order condition for the individual’ choice of e: s (10) t= @z @e z 0 (e) Intuitively, the individual sets the marginal private bene…t of raising e by $1 (saving $t) equal to the marginal private cost. Combining (10) with (9) yields (11) dLI dW =t dt dt Equation (11) shows that, in the transfer cost model, the taxable income elasticity is not a su¢ cient statistic to calculate deadweight loss. Instead, excess burden is determined purely by the e¤ect of taxation on total earned income ( dLI ) –the e¤ect of taxes on “real”labor supply behavior.6 If T I dt responds to t only because of sheltering, there is no deadweight loss. Intuitively, the total size of the pie is una¤ected by sheltering in this model –the transfer cost z simply a¤ects how the pie is split. In contrast, in the resource cost model, the cost of sheltering g(e) is pure waste. I.D Resource and Transfer Costs Now suppose that sheltering e dollars of income from taxation requires payment of both a resource cost g(e) and a transfer cost z(e; t). The individual chooses e and l to maximize his utility net of 6 Slemrod (2001) proposes a model of evasion in which the expected …ne z depends on earnings (wl) as well as the amount evaded (e). For example, earning a higher income could increase the probability of audit or reduce the @z cost of hiding income. The qualitative results in this paper hold when @wl 6= 0. However, (11) has an added term @z @wl @z , re‡ ecting the e¤ect of earned income on the transfer cost. This is because 0 (l) = 1 t @wl @wl at the @wl @t @t optimum, changing the social cost of distortions in labor supply. 7 both resource and transfer costs: (12) max u(c; l; e) = c e;l (l) g(e) t)(wl e) + e z(e; t) s.t. c = y + (1 Social welfare is (13) W (t) = fy + (1 t)(wl e) + e z(e; t) (l) g(e)g + z(e; t) + t(wl e) The same derivation as in the previous subsection gives (14) dLI de @z dW =t + ( dt dt dt @e t) The di¤erence from the pure transfer cost model is that the …rst order condition now has an additional term relative to (10), re‡ ecting the marginal resource cost: (15) t = z 0 (e) + g 0 (e) This leads to a general formula for deadweight burden with transfer and resource costs: (16) (17) (18) where = g 0 (e) t dW dt dLI de g 0 (e) dt dt dT I dLI = tf + (1 ) g dt dt t = f T I"T I + (1 )wl"LI g 1 t = t = g 0 (e) z 0 (e)+g 0 (e) denotes the fraction of the total cost of sheltering accounted for by and "LI = dLI 1 t dt wl resource costs and "T I = dT I 1 t dt wl e denote the taxable income and total earned = 1, and (18) reduces to the income elasticities, respectively. When there is no transfer cost, standard taxable income formula for deadweight loss in (6). is no resource cost, At the other extreme, when there = 0, and the formula reduces to (11), which depends only on the earned 2 (0; 1), the marginal excess burden is determined is income elasticity. In the general case where by a weighted average of the taxable income and total earned income elasticities. The weight determined by the magnitude of resource costs of sheltering relative to the tax rate or, equivalently, the magnitude of resource costs relative to the total (resource plus transfer) costs of sheltering. 8 Estimating the earned income elasticity "LI requires a method of inferring total earnings empirically. Avoidance behaviors that generate tax deductions – such as contributions to a charity or trust – are generally reported on income tax returns. One can therefore construct a gross- of-avoidance measure of earned income from the tax data used to estimate the taxable income elasticity. Recovering total earnings in the presence of evasion requires additional data. Total earnings can be imputed from consumption data, as in Christopher A. Pissarides and Guglielmo Weber (1989). One can also directly estimate the e¤ect of tax rates on evasion ( @e ) using audit @t data, as in Charles T. Clotfelter (1983). If sheltering responses are much larger than “real” labor supply responses –as documented by Slemrod (1992, 1995) –an estimate of "LI is less important because excess burden can be approximated simply by multiplying "T I by . Why is it necessary to distinguish evasion and avoidance responses from changes in labor supply to calculate excess burden when reducing labor supply, w 0 < 1? The agent’ choice of l equates the marginal social cost of s (l), with the tax rate t. However, the agent’ choice of e equates the s marginal private cost of sheltering (g 0 +z 0 ) with the tax rate t. Because the marginal private cost of sheltering di¤ers from its social cost when < 1, the key condition underlying Feldstein’ formula s –that the marginal social cost of all behaviors equals the tax rate –is violated for sheltering. This forces us to separate the two behaviors to calculate excess burden. Further intuition for why the violation of the g 0 (e) = t condition makes II.A. Abstractly, it is not surprising that transfer costs a¤ect the formula for excess burden, since externalities always introduce additional terms in e¢ ciency calculations. Transfers are a subset of externalities where the agent making the sheltering decision bears a cost that is fully o¤set by a positive externality on other agents. Sheltering can also generate non-transfer externalities, such as the costs borne by the government for audits or imprisonment of tax evaders. Such non-transfer externalities a¤ect the social resource costs of sheltering. Equation (18) holds with non-transfer externalities as long as they are included when measuring g 0 (e).7 It does not matter whether the individual making the sheltering decision or other agents bear resource costs, because social welfare W (t) depends only on total resource costs to society. Unlike transfer externalities, however, negative non-transfer externalities generated by sheltering would make the standard taxable income formula in (3) understate deadweight loss. 7 dW dt a weighted average of "T I and "LI is given in section 9 I.E Optimization Errors The analysis thus far has highlighted transfer costs as a source of a wedge between marginal resource costs of sheltering and the tax rate. Such a wedge can also arise from misperceptions of the cost of sheltering. Surveys show that many individuals substantially overestimate audit rates and …nes associated with tax evasion (John T. Scholz and Neil Pinney 1993; Dick J. Hessing et al. 1992; Andreoni, Erard, and Feinstein 1998). Misinformed agents may not take full advantage of sheltering strategies despite their low marginal resource costs. Motivated by this evidence, I extend the analysis to allow for misperceptions and optimization errors in sheltering decisions. Suppose the agent perceives the resource cost of sheltering to be g (e) and the transfer cost of sheltering to be z (e; t). For example, in the auditing model of section b b II.C, if the agent’ perceived probability of being caught for evasion is p(e) and his perceived …ne s b b b is F (e), then z (e; t) = p(e)[te + F (e)]. The individual chooses l and e to maximize his perceived b b max u(c; l; e) = y + (1 e;l expected utility (19) t)(wl e) + e Social welfare is una¤ected by the individual’ perceptions and remains the same as in (13): s (20) W (t) = fy + (1 t)(wl e) + e z(e; t) (l(t)) g(e)g + z(e; t) + t(wl e) z (e; t) b (l) g (e) b Because the utility function is separable in l and e, l is e¤ectively chosen to optimize the term in the curly brackets even though e is not. Using the envelope condition for l and recognizing that e is not optimized, we obtain (21) (22) (23) where = g 0 (e) 8 t . dW dt = = t (1 t) dLI dt dT I = tf + (1 dt de de + dt dt de g 0 (e) dt ) g 0 (e) de dLI + t( dt dt de ) dt dLI g dt Equation (23), which is the most general formula for marginal excess burden presented in this paper, coincides with the formula obtained above in (18). The intuition is again g (e) When agents do not optimize, one cannot write = g0 (e)+z0 (e) because the …rst order condition g 0 (e) + z 0 (e) = t need not hold. Hence, resource costs should be normalized by the tax rate to calculate the weight in the general case where agents face transfer costs and do not optimize perfectly. 8 0 10 that the agent does not equate the marginal social cost of sheltering with the tax rate, making it necessary to weight de dt by g 0 (e) instead of t when calculating marginal excess burden. agent’ perceived transfer and resource costs. s Since (23) does not rely on any assumptions about g (e) or z (e; t), it holds irrespective of the b b More generally, one does not have to specify the positive model of behavior that drives the choice of e in order to derive (23). Hence, provided that we can measure g 0 (e), we do not need to have a complete explanation of why there is a gap between g 0 (e) and t to calculate marginal excess burden. This is an especially useful property because the model that explains observed evasion and avoidance behavior is debated (Andreoni, Erard, and Feinstein 1998). Several caveats must be kept in mind in implementing (23) when agents make optimization errors. First, the labor supply decision must be separable from the sheltering decision to obtain (23) –that is, the optimal choice of l must be invariant to the choice of e. Intuitively, l will not be set at the unconstrained optimum if e is not optimized and sheltering a¤ects the marginal return to work.9 Second, recent evidence suggests that individuals misperceive not only the …nes for tax evasion but also tax rates themselves. If agents choose l suboptimally, the …rst order condition for l does not hold and hence the t dLI component of (23) requires modi…cation. The formula dt can be extended to accommodate optimization errors with respect to tax rates using an estimate of the wage elasticity of labor supply ( dLI ), as shown by Chetty, Adam Looney, and Kory Kroft dw (2007). Finally, although a fully speci…ed model for why g 0 (e) di¤ers from t in equilibrium is not required, some understanding of the positive model that drives sheltering is needed to measure g 0 (e). For example, if agents do not evade taxes because of private ethical costs of doing so, one may include these costs when calculating g 0 (e). In contrast, if agents do not evade taxes because of misinformation about audit rates and …nes, the actual marginal resource cost g 0 (e) would be lower. I.F Implications for Optimal Taxation The preceding results have di¤erent implications for the measurement of deadweight loss and the determination of optimal taxes. The taxable income elasticity is always a necessary input for revenue and optimal tax calculations, irrespective of its relevance for excess burden calculations. To illustrate this point, I consider a simple Ramsey tax problem. Suppose that one dollar of 9 For example, separability is violated if the cost of evasion z depends on earnings wl or if there are income e¤ects in labor supply (Yitzhaki 1987). 11 government spending on public goods generates social bene…ts of 1 + , so that social welfare is (24) f W (t; ) = fy + (1 t)(wl e) + e z(e; t) (l(t)) g(e)g + z(e; t) + (1 + )(wl e)t This social welfare function nests that in sections I.A-I.E, where I assumed reproduce the compensating-variation measure of excess burden. When = 0 in order to = 0, the optimal tax The rate is trivially t = 0 since taxation generates no bene…t but creates an e¢ ciency cost. interesting case for optimal taxation is the situation where above gives f dW = dt t 1 t t 1 t > 0. A derivation analogous to that (25) f T I"T I + (1 )wl"LI g + T I(1 "T I ) (26) f It follows that the tax rate t that maximizes W (t) satis…es: t 1 t = ( + )"T I + (1 wl ) wl e "LI When 1 1+ "T I ). = 1, (26) collapses to the standard inverse-elasticity rule for optimal taxation ( 1 t t = When = 0, t is a function of both "T I and "LI , even though excess burden depends purely on "LI . This is because the relevant consideration for optimal tax design is the marginal cost of public funds –the deadweight cost generated per dollar of revenue raised: (27) M CP F (t) = dW=dt = dR=dt t 1 tf T I"T I + (1 e)(1 (wl "T I 1 t ) t )wl"LI g The optimal linear tax t equates the marginal cost of public funds with the marginal bene…t of public expenditure: M CP F (t ) = .10 The taxable income elasticity determines how much the tax rate must be raised to generate an additional dollar of tax revenue (the denominator of (27)), while the earned income elasticity a¤ects the marginal e¢ ciency cost of that tax increase (the numerator of (27)). The formula for dR dt is una¤ected by transfer costs or errors in optimization, and depends on "T I irrespective of . Thus, estimates of both "T I and "LI are required to analyze optimal taxation, even if sheltering has zero resource cost.11 10 The MCPF is given by (27) only in a Ramsey model with a linear tax. In non-linear income tax models, the formula for the MCPF is more complex, as shown by Dahlby (1998). However, the qualitative point that both "T I and "LI matter for optimal taxation applies in models of non-linear taxation. 11 A related point is that compliance costs borne by the individual must be distinguished from administrative costs of tax collection in determining t , because administrative costs enter the denominator of the M CP F whereas 12 Note that even if sheltering has no e¢ ciency cost, it could still be desirable to reduce sheltering from the perspective of optimal policy. Reducing tax evasion through sti¤er penalties could be a more e¢ cient way to generate revenue than raising distortionary taxes even if evasion has no resource cost (Louis Kaplow 1991, Joram Mayshar 1991). Again, additional factors beyond the marginal excess burden of raising t are relevant for the optimal policy problem. II II.A Discussion Foundations of Weighted Average Formula Why does relaxing the assumption that g 0 (e) = t lead to a formula for marginal excess burden that is a weighted average of "T I and "LI ? To obtain an answer this question, it is useful to analyze the e¢ ciency cost of income taxation at a more abstract level. Consider a model where the agent makes N choices fx1 ; :::; xN g that contribute additively to taxable income, so that total taxable P income is T I = xi . Choice xi has a convex, increasing social cost of gi (xi ). The government levies an income tax t on taxable income, so that social welfare is given by W = f(1 t) X xi X gi (xi )g + t X xi (28) In this model, the excess burden of increasing the income tax t can be expressed as the sum of each of the behavioral responses weighted by their marginal social costs: X dxi dW 0 = gi (xi ) dt dt i=1 N (29) This expression for marginal excess burden is the most robust available because it does not rely on agent optimization and permits arbitrary externalities in the N decisions. The shortcoming of this formula is that it is di¢ cult to separately estimate all behavioral responses and their total marginal social costs. Empirical implementation can be simpli…ed by making assumptions about the underlying positive model. Feldstein (1999) assumes that there are no externalities and that P 0 agents choose fxi gN optimally, so that gx (xi ) = t 8i. Under this assumption, dW = t N dxi = i=1 i=1 dt dt The present paper makes a weaker assumption: choices that a¤ect total earnings (LI) are made compliance costs do not (Slemrod and Yitzhaki 1996). t dT I . dt 13 optimally and do not generate externalities, but choices that create a di¤erence between total earnings and reported taxable income (T I LI) may be suboptimal and may generate externalities. P Suppose that choices 1 to n a¤ect total earnings, so that LI = n xi and choices n + 1 to N i=1 PN a¤ect only reported taxable income, so that total sheltering is e = T I LI = i=n+1 xi . Then dW dt = n X i=1 (30) dxi 0 gi (xi ) dt dxi + dt + i=n+1 N X 0 gi (xi ) (31) (32) PN dxi 0 i=n+1 gi (xi ) dt PN dxi i=n+1 dt = t dLI = t dt n X i=1 N X 0 gi (xi ) dxi dt i=n+1 dxi dt g 0 (e) de dt where g 0 (e) = denotes the mean marginal resource cost of sheltering. This derivation shows that the weaker assumption made here about the decisions underlying LI and T I directly leads to the weighted average formula for dW dt in (23). It also shows that (23) remains valid when earnings are a¤ected by multiple decisions (e.g. training, e¤ort, occupation) because "LI automatically aggregates all these behavioral responses. If these labor supply choices also have externalities, they should also be treated like sheltering decisions. For example, parents’ choice of labor supply may have an externality on children or executives’leisure choices may a¤ect their assistants’leisure. In these cases, one should weight dLI dt by its average marginal social cost dW dt . including all external e¤ects, instead of the tax rate, to calculate Thus, the formula for marginal excess burden proposed here is accurate when the important di¤erences between social and perceived private costs are in sheltering decisions rather than total earnings behavior. II.B Examples of Transfer Costs There are two types of transfer costs: transfers to the government (revenue o¤sets) or to other agents in the private sector. Fines for tax evasion are the simplest example of transfers to the government. A more important example in practice is the shifting of income between tax bases, e.g. from personal to corporate income (Gordon and Slemrod 2002). If corporate income is taxed at a rate tc , the agent e¤ectively pays a transfer cost of z(e) = tc e to the government to shelter e from income taxation through shifting. Private transfer costs can arise from penalties for evasion and avoidance imposed by other agents in the private sector. For example, a manager may be …red by shareholders if he is discovered 14 to be using illegal tax sheltering strategies, thereby losing a bonus. A …rm may lose clients to a competitor if it is identi…ed as a tax evader. An individual may lose his wealth to theft by holding it in the form of cash or hidden accounts. Penalties that deter evasion could also deter quasi-legal avoidance strategies –e.g. declaring a vehicle or travel expense as a “business expense”–since the border between avoidance and evasion is often ambiguous. Misperceptions of such penalties would increase perceived transfer costs. Avoidance strategies that are clearly legal can also be deterred by private transfer costs. The most important example is charitable contributions, whose bene…ts to recipients may not be fully internalized by the donor, e¤ectively creating a transfer cost (Kaplow 1995). Empirical studies have shown that charitable contributions are highly sensitive to tax rates (see e.g. Feldstein and Amy Taylor 1976, Clotfelter 1985), suggesting that a signi…cant part of the taxable income response observed in the data could be driven by this margin. Transfers within a family are another example. If an individual values his children’ consumption at less than $1 but the planner weights s all individuals equally, the individual e¤ectively incurs a transfer cost when sheltering money from taxation through trusts or inter-vivos transfers.12 Transfer costs can also arise indirectly within the private sector. Suppose an executive is deciding whether to compensate himself in the form of a taxed dollar of labor income or untaxed perks such as amenities in the o¢ ce (e.g. a better building, child care facilities, better company cars). Such perks have two forms of transfer costs. First, since an executive typically cannot take the perks with him to another job, some fraction of the bene…ts (in expectation) are transferred to subsequent executives. Second, some of the surplus from the o¢ ce amenities may have to be shared with other employees even contemporaneously – it is di¢ cult to improve only part of a building. In considering the e¤ects of indirect transfers, it is important to note that the only private transfer externalities that a¤ect the Feldstein (1999) formula are those which are not internalized by agents through Coasian bargaining. For example, if a manager renegotiates his contract with those a¤ected by his sheltering behavior (shareholders, employees, etc.), he e¤ectively faces no private transfer cost in sheltering because his salary can be adjusted to o¤set any externalities. In practice, transaction costs and information problems likely impede perfect state-contingent contracting, and some sheltering behaviors may therefore be deterred by indirect private transfer costs at the margin. Each of these examples also involves some resource costs. For instance, the e¤orts spent by charities or children on lobbying to receive a transfer constitutes a resource cost. 12 15 Moreover, Coasian bargaining cannot overcome the externalities involved in direct redistribution of money to charities or family (Kaplow 1995). II.C Comparison to Existing Formulas The formula proposed here is not the only method of accounting for government transfer costs (revenue o¤sets) when agents optimize fully. An alternative approach is to compare the mechanical change in total tax revenue (from all tax bases) with the actual change in total tax revenue (Auerbach 1985, Slemrod 1998): (33) where R = t(wl from audits) and dW dR = dt dt @R jl;e @t e) + z(e) denotes total tax revenue from all tax bases (including …nes collected @R @t jl;e denotes the mechanical change in tax revenue if l and e were unchanged. Saez (2004) proposes a variant of the revenue-distortion approach for the special case where agents shelter income by shifting earnings from the income to corporate tax bases. Saez’ approach s is to adjust Feldstein’ (1999) formula by adding a term re‡ s ecting the added revenue raised from the corporate tax via shifting: (34) dW = dt f t 1 t (wl e)"T I tc 1 t e"e g where tc is the corporate tax rate and "e = de 1 t dt e is the elasticity of income shifting from the income to corporate tax base with respect to the net-of-income tax rate. When agents optimize perfectly and there are no private transfer costs, the three formulas are just di¤erent representations of the same equation, and should yield exactly the same estimate of marginal excess burden. The three formulas di¤er in the types of data that they employ. To implement Slemrod’ and Saez’ revenue-adjustment formulas, one must identify all the behavioral s s responses through which total tax revenue is a¤ected following an income tax change. To implement the formula here, one must estimate the taxable and total income elasticities and the marginal resource cost of sheltering. In some applications, it may be easier to trace out revenue e¤ects, while in others it may be easier to estimate the marginal resource cost of sheltering. Hence, the three formulas should be viewed as complements for empirical applications. One bene…t of the formula here is that it sheds light on the types of taxes and behavioral responses that generate the largest e¢ ciency costs. 16 In particular, it shows that the deadweight cost of sheltering is proportional to its resource cost, irrespective of its e¤ects on other parts of the government’ budget. In addition, only the formula proposed here accounts for transfers within s the private sector and optimization errors in sheltering. This is because within-private-sector externalities and errors in perceived costs of sheltering do not show up on the government’ budget. s An instructive proof showing why the revenue-adjustment approaches cannot be applied in these situations is given in Appendix B. III Conclusion This paper has extended Feldstein’ (1999) taxable income formula for deadweight loss to an envis ronment in which agents’perceived private costs of evasion and avoidance di¤er from their social costs. The generalized formula shows that, to calculate the excess burden of a change in the income tax, one must determine (1) how much of the taxable income elasticity is driven by variation in labor supply choices vs. sheltering ("T I ; "LI ); and (2) the marginal resource cost of sheltering ( ). These factors are particularly important in understanding the e¢ ciency cost of taxing individuals who have substantial ability to shelter income, such as high-income and self-employed individuals. Characterizing the resource costs of reporting lower income is also important for topics beyond optimal income taxation. For example, Gruber and Joshua D. Rauh (2007) estimate the sensi- tivity of reported corporate pro…ts to corporate tax rates to calculate the excess burden of the corporate tax. If corporations’ reported pro…ts are sensitive to tax rates primarily because of reporting e¤ects and these changes in reporting do not have substantial resource costs, the excess burden from corporate taxation may be smaller than implied by Gruber and Rauh’ calculations. s Another example is Looney and Monica Singhal’ (2006) use of the taxable income measure to s estimate intertemporal substitution elasticities using anticipated changes in tax rates. Although individuals may shift their reported taxable income signi…cantly across periods to minimize their tax burdens, labor supply and total earnings could be less elastic intertemporally (Goolsbee 2000). Since aggregate output is what matters in calibrating macroeconomic models, the resource costs of intertemporal shifting must be quanti…ed in order to translate Looney and Singhal’ estimates into s the relevant intertemporal elasticity. Estimating the marginal resource cost of sheltering, g 0 (e), is especially important because its potential values span a very wide range. Many forms of sheltering appear to have low accounting costs relative to top marginal tax rates (approximately 40% in the U.S.). For instance, the accounting costs of conducting business in cash to under-report taxable income or setting up o¤shore 17 accounts, trusts, and foundations are typically less than 5% of the amount sheltered. There could, however, be substantial economic resource costs from such sheltering behaviors, such as the need to maintain an ine¢ ciently small …rm to facilitate under-reporting or the loss of control inherent in delegating money to trusts and foundations. One must also account for resource costs that arise from distortions in consumption patterns induced by tax-sheltering motives, such as overconsumption of tax-favored goods (e.g. housing and healthcare) or business-related expenses (e.g. ‡ ying in …rst class, having a lavish o¢ ce). Once these additional economic resource costs of sheltering are taken into account, g 0 (e) could potentially be close to the marginal tax rate. Depending on where g 0 (e) lies in the range from zero to the tax rate, the excess burden due to tax sheltering behavior could range from zero to the large values implied by existing studies of the elasticity of taxable income. To estimate the marginal resource cost of sheltering, one must develop a mapping between the primitive parameter g 0 (e) and observable behaviors (Chetty 2008). One promising approach to this problem is to compare the e¤ects of tax changes on reported income and consumption bundles, recognizing that real resource costs ultimately distort consumption patterns (Yuriy Gorodnichenko, Jorge Martinez-Vazquez, and Klara Sabirianova Peter 2008).13 Gorodnichenko et al. show that a large tax reform in 2001 in Russia induced substantial changes in reported taxable income but little change in the level and composition of consumption. Their …ndings suggest that the resource costs of changes in reported taxable income are small in the Russian case. Additional studies on the e¤ects of tax reforms on consumption would be valuable given the prevalence of evasion and avoidance even in the U.S.: recent studies estimate that the evasion tax gap in the U.S. is 15% of tax revenue and that the avoidance tax gap is also substantial (Slemrod and Yitzhaki 2002, Slemrod 2007). 13 The bene…t of the consumption measure is that it automatically aggregates resource costs across di¤erent types of sheltering behaviors. Without this aggregate measure, one would have to calculate the marginal resource cost for each sheltering behavior individually and compute the weighted mean value to arrive at the relevant value of g 0 (e), as in (32). The disadvantage of the consumption measure is that it does not capture non-monetary costs, such as the cost of violating ethical principles by evading taxes. 18 References Allingham, Michael G., and Agnar Sandmo. 1972. “Income Tax Evasion: A Theoretical Analysis.” Journal of Public Economics, 1(3-4): 323-338. Andreoni, James, Brian Erard, and Jonathan Feinstein. 1998. Journal of Economic Literature, 36: 818-860. “Tax Compliance.” Auerbach, Alan J. 1985. “The Theory of Excess Burden and Optimal Taxation.” In Vol. 1, Handbook of Public Economics, ed. Alan J. Auerbach and Martin S. Feldstein, 67-127. Amsterdam: Elsevier Science Publishers. Auerbach, Alan J., and James R. Hines Jr. 2002. “Taxation and economic e¢ ciency.” In Vol. 3, Handbook of Public Economics, ed. Alan J. Auerbach and Martin S. Feldstein, 1347-1421. Amsterdam: Elsevier Science Publishers. Chetty, Raj. 2008. “Su¢ cient Statistics for Welfare Analysis: A Bridge Between Structural and Reduced-Form Methods.” UC-Berkeley mimeo. Chetty, Raj, Adam Looney, and Kory Kroft. 2008. “Salience and Taxation: Theory and Evidence.” American Economic Review. Forthcoming. Clotfelter, Charles T. 1983. “Tax Evasion and Tax Rates: An Analysis of Individual Returns.” Review of Economics and Statistics, 65(3): 363-373. Clotfelter, Charles T. 1985. Federal Tax Policy and Charitable Giving. Chicago: University of Chicago Press. Dahlby, Bev. 1998. “Progressive taxation and the social marginal cost of public funds.” Journal of Public Economics, 67(1): 105-122. Feldstein, Martin S. 1995. “The E¤ect of Marginal Tax Rates on Taxable Income: A Panel Study of the 1986 Tax Reform Act.” Journal of Political Economy, 103(3): 551-572. Feldstein, Martin S. 1999. “Tax Avoidance and the Deadweight Loss of the Income Tax.” Review of Economics and Statistics, 81(4): 674-680. Feldstein, Martin S. 2006. “The E¤ect of Taxes on E¢ ciency and Growth.” Tax Notes, 111(6): 679-684. Feldstein, Martin S., and Amy Taylor. 1976. “The Income Tax and Charitable Contributions.” Econometrica, 44(6): 1201-1222. Goolsbee, Austan. 1999. “Evidence on the High-Income La¤er Curve from Six Decades of Tax Reform.” Brookings Papers on Economic Activity, 30(1999-2): 1-64. Goolsbee, Austan. 2000. “What Happens When You Tax the Rich? Evidence from Executive Compensation.” Journal of Political Economy, 108(2): 352-378. Gordon, Roger H., and Joel B. Slemrod. 2002. “Are ‘ Real’ Responses to Taxation Simply Shifting Between Corporate and Personal Tax Bases?”In Does Atlas Shrug? The Economic Consequences of Taxing the Rich, ed. Joel Slemrod. Cambridge: Harvard University Press. Gorodnichenko, Yuriy, Jorge Martinez-Vazquez, and Klara Sabirianova Peter. 2008. “Myth and Reality of Flat Tax Reform: Micro Estimates of Tax Evasion and Productivity Response in Russia.” NBER Working Paper 13719. Gruber, Jonathan, and Emmanuel Saez. 2002. “The Elasticity of Taxable Income: Evidence and Implications.” Journal of Public Economics, 84(1): 1-32. 19 Gruber, Jonathan, and Joshua D. Rauh. 2007. “How Elastic is the Corporate Income Tax Base?” In Taxing Corporate Income in the 21st Century, ed. Alan J. Auerbach, James R. Hines, and Joel B. Slemrod. Cambridge: Cambridge University Press. Hessing, Dick J., Hank El¤ers, Henry S.J. Robben, and Paul Webley. 1992. “Does deterrence deter? Measuring the e¤ect of deterrence on tax compliance in …eld studies and experimental studies.”In Why people pay taxes: tax compliance and enforcement, ed. Joel Slemrod. Ann Arbor: University of Michigan Press. Joint Economic Committee, U.S. Congress. 2001. “Economic Bene…ts of Personal Income Tax Rate Reductions.” Kaplow, Louis. 1991. “Optimal Taxation with Costly Enforcement and Evasion.”Journal of Public Economics, 43(2): 221-236. Kaplow, Louis. 1995. “A Note on Subsidizing Gifts.” Journal of Public Economics 58(3): 469-477. Lindsey, Lawrence B. 1987. “Individual Taxpayer Response to Tax Cuts: 1982-1984, with Implications for the Revenue Maximizing Tax Rate.” Journal of Public Economics, 33(2): 173-206. Looney, Adam, and Monica Singhal. 2006. “The E¤ect of Anticipated Tax Changes on Intertemporal Labor Supply and the Realization of Taxable Income.” NBER Working Paper 12417. Mayshar, Joram. 1991. “Taxation with Costly Administration” Scandinavian Journal of Economics, 93(1): 75-88. Pissarides, Christopher A., and Guglielmo Weber. 1989. “An Expenditure-Based Estimate of Britain’ Black Economy.” Journal of Public Economics, 39(1): 17– s 32. Saez, Emmanuel. 2004. “Reported Incomes and Marginal Tax Rates, 1960-2000: Evidence and Policy Implications.” In Vol. 18, Tax Policy and the Economy, ed. James Poterba. Cambridge: MIT Press. Scholz, John T., and Neil Pinney. 1995. “Duty, Fear, and Tax Compliance: The Heuristic Basis of Citizenship Behavior.” American Journal of Political Science, 39(2): 490-512. Slemrod, Joel B. 1992. “Do Taxes Matter? Lessons from the 1980s.” American Economic Review, 82(2): 250– 256. Slemrod, Joel B. 1995. “Income Creation or Income Shifting? Behavioral Responses to the Tax Reform Act of 1986.” American Economic Review, 85(2): 175-180. Slemrod, Joel B. 1998. “Methodological Issues in Measuring and Interpreting Taxable Income Elasticities.” National Tax Journal, 51(4): 773-788. Slemrod, Joel B. 2001. “A General Model of the Behavioral Response to Taxation.” International Tax and Public Finance, 8: 119– 128. Slemrod, Joel B. 2007. “Cheating Ourselves: The Economics of Tax Evasion.” The Journal of Economic Perspectives, 21(1): 25-48. Slemrod, Joel B., and Shlomo Yitzhaki. 1996. “The Cost of Taxation and the Marginal E¢ ciency Cost of Funds.” International Monetary Fund Sta¤ Papers, 43(1): 172-198. Slemrod, Joel B., and Shlomo Yitzhaki. 2002. “Tax avoidance, evasion, and administration.” In Vol. 3, Handbook of Public Economics, ed. Alan J. Auerbach and Martin S. Feldstein, 1423-1470. Amsterdam: Elsevier Science Publishers. Yitzhaki, Shlomo. 1974. “A Note on ‘ Income Tax Evasion: A Theoretical Analysis.’ Journal ” 20 of Public Economics, 3(2): 201-202. Yitzhaki, Shlomo. 1987. “On the Excess Burden of Tax Evasion.” Public Finance Review, 15(2): 123-137. 21 Appendix A. Auditing Model with Risk Averse Agent and Administrative Costs This appendix extends the analysis in section I.D in two ways. First, it models risk bearing as an excess burden of tax evasion, as in Yitzhaki (1987). Second, it shows how resource costs of auditing or …nes can be taken into account. Consider an economy populated by a set of identical agents of measure 1. Let each agent’ utility s be denoted by u(c), which is increasing and concave. Agents are audited with probability p(e), and the probability of audit independent across agents. In the state where an agent successfully evades taxes and is not audited, his income is y + (1 t)(wl e) + e, where y represents unearned income. In the state where he is audited and caught, his income is y + (1 t)wl F (e) where y denotes unearned income. Suppose that the government bears a resource cost of auditing given e by an increasing, convex function K1 (p(e)) and a resource cost of imposing …nes (e.g. running a e e e prison) given by an increasing, convex function K2 (F (e)). Let K(e) = K1 (p(e)) + K2 (F (e)) denote total administrative resource costs as a function of the amount of evasion. Let V (t; y) denote the agent’ expected utility as a function of the tax rate and unearned income s and E(t; V ) denote the corresponding expenditure function. Following Auerbach (1985), I de…ne excess burden using the compensated variation measure: (35) where R(t) = t(wlc (t) ec (t)) + p(ec (t))[tec (t) + F (ec (t))] K(ec (t)) is tax revenue net of administrative costs and lc (t) and ec (t) are income-compensated Hicksian demand functions. Given the continuum of agents, tax revenue is deterministic. To calculate excess burden, assume that tax revenue is returned lump-sum to every agent irrespective of whether he is audited or not. Our objective is to derive an elasticity-based expression for marginal excess burden, dEB . To dt begin, observe that the agent’ indirect utility function is s (36) V (t; y) = max(1 e;l EB = E(t; V (0; y)) y R(t) p(e))u(y + (1 t)(wl e) + e) + p(e)u(y + (1 t)wl F (e)) (l) The expenditure function is (37) E(t; V ) = min y + (V e;l; (1 p(e))u(y +(1 t)(wl e)+e) p(e)u(y +(1 t)wl F (e))+ (l)) where denotes the multiplier on the utility constraint. Let ch denote consumption in the “good” state (where the agent is not caught) and cl denote consumption is the state where he is caught and Eu0 (c) = (1 p(e))u0 (ch ) + p(e)u0 (cl ) denote the expected marginal utility of consumption. Using the …rst order conditions from agent optimization, it is easy to see that = Eu1(c) . Note 0 that ch cl = te + F . Hence (38) (39) dE dt dR dt = 1 f(1 Eu0 (c) p(e))u0 (ch )(wl e) + p(e)u0 (cl )wlg K 0 (e)) = wl e + p(e)e + t d[wlc ] dec + ( t + p0 (e)[te + F ] + p(e)[t + F 0 (e)] dt dt 22 It follows that (40) dEB dt = t d[wlc ] [u0 (ch ) u0 (cl )] + p(e)(1 p(e))e dt Eu0 (c) dec [( t + p0 (e))(te + F ) + p(e)(t + F 0 (e)) + dt K 0 (e)] To simplify the third term in this expression, consider the agent’ …rst order condition with respect s to the choice of e: (41) t = p0 (e) [u(ch ) u(cl )] [u0 (ch )t + u0 (cl )F 0 (e)] + p(e) u0 (ch ) u0 (ch ) Using this expression and collecting terms, we obtain (42) dEB dt = t d[wlc ] [u0 (ch ) u0 (cl )] + p(e)(1 p(e))e dt Eu0 (c) c u(ch ) u(cl ) u0 (ch ) u0 (cl ) de 0 [p (e)fch cl g + p(e)F 0 (e)f g + dt u0 (ch ) u0 (ch ) K 0 (e)] To clarify the intuition for this formula, I take a quadratic approximation to the utility function u00 (c) around ch and write the formula in terms of the coe¢ cient of relative risk aversion, = u0 (c) c: (43) dEB dt t d[wlc ] dt fp(e)(1 p(e))e c g c dec c 1 0 c fp(e)F 0 (e) + p (e)ch ( )2 dt c 2 c K 0 (e)g where cc = te+F denotes the percentage loss in private income when the agent is caught. Note ch that when = 0 (risk neutrality), the last two terms drop out of (43) and it coincides with (11) as expected. The second term re‡ ects the cost of the additional risk directly from the increase in the tax rate, which raises the di¤erence in income between the two states in proportion to the amount of tax evaded e. The third term re‡ ects the cost of the additional risk that the agent bears from one dollar of additional evasion, due to the increased …ne and probability of audit. Both of these additional risk costs constitute real resource costs because they reduce net social welfare. De…ne the marginal resource cost of evasion as g 0 (e) = fp(e)F 0 (e) cc + 1 p0 (e)ch ( cc )2 K 0 (e)g and 2 0 the marginal resource cost directly from the increase in the tax rate as gt (e) = fp(e)(1 p(e))e cc g. Then we can rewrite (43) as dW dt dEB d[wlc ] dec 0 =t g 0 (e) gt (e) dt dt dt t f "c I (wl e) + (1 )"c wlg T LI 1 t (44) (45) = = 0 gt (e) where "c I denotes the compensated taxable income elasticity and "c denotes the compensated T LI total earned income elasticity. This equation coincides with the formula for the general case with 0 resource and transfer costs in (18) except for the additional term gt (e) re‡ ecting the direct cost of subjecting the agent to more risk when t is increased. Excess burden still depends on a weighted average of the (compensated) taxable income and total earned income elasticities. The weight is still proportional to g 0 (e), which is now rede…ned to include the risk-bearing costs of evasion and administrative costs paid by the government. 23 Appendix B. Necessary Conditions for Revenue-Adjustment Formulas In the general case with resource costs, transfer costs, and optimization errors, recall from section I.E that (46) dLI dW =t dt dt g 0 (e) de dt To understand the correspondence between this formula and the revenue-adjustment approaches of Slemrod (1998) and Saez (2004), it is useful to distinguish between three cases. 1. Revenue o¤set with agent optimization. Suppose z(e) accrues to the government, so that R = t(wl e) + z(e). Then @R jl;e = wl e while dR = wl e + t d(wl e) + @z de . If the agent @t dt dt @e dt optimizes his sheltering decision, g 0 (e) = t z 0 (e). Hence the standard tax revenue adjustment formula holds: (47) dW dR = dt dt @R jl;e @t 2. Private transfer cost with agent optimization. Suppose z(e) is a private transfer to another agent in the economy. Then dR = wl e + t d(wl e) , and we obtain dt dt (48) dW dR = dt dt @R de jl;e + z 0 (e) @t dt where the additional term re‡ ects the externality transfer within the private sector that occurs through the agent’ behavioral response to the tax change. The revenue adjustment formula does s not hold with private transfers. 3. Revenue o¤set with optimization error in sheltering. As in case 1, z(e) accrues to the government, so that R = t(wl e) + z(e), and dR = wl e + t d(wl e) + @z de . But we cannot equate g 0 (e) dt dt @e dt with t z 0 (e) because the agent’ …rst-order-condition for e does not hold. Hence the simplest s revenue-adjustment formula that can be obtained is (49) dW dR = dt dt @R jl;e + (t @t z 0 (e) g 0 (e)) de dt where the additional term re‡ ects the gap between the tax rate and the sum of the transfer and resource costs caused by the agent’ optimization error. In the case where the agent optimizes, s t z 0 (e) g 0 (e) = 0, and the formula collapses to the standard revenue-adjustment formula. These derivations establish that the revenue adjustment formula holds if and only if all transfer costs are paid to the government and agents optimize their sheltering decisions. The Saez formula in (34) is an alternative representation of the revenue adjustment formula in (33) for the special case with two tax bases, and hence this result applies to that formula as well. 24