-
Notifications
You must be signed in to change notification settings - Fork 4
/
Copy pathmaster.tex
3080 lines (2737 loc) · 133 KB
/
master.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
% (c) Jyotirmoy Bhattacharya, 2013
% Email: [email protected]
%
% This work is licensed under a
% Creative Commons Attribution-ShareAlike 3.0 Unported
% License,
% http://creativecommons.org/licenses/by-sa/3.0/deed.en_US}.
%
%
\documentclass[11pt,reqno,openany]{amsbook}
\title{Macroeconomics}
\input{preamble}
\author{Jyotirmoy Bhattacharya}
\address{Ambedkar University Delhi}
\email{[email protected]}
\date{\today}
\begin{document}
\frontmatter
\input{titlepage}
\tableofcontents
\mainmatter
\chapter{Introduction}
Macroeconomics tries to understand the dynamics of economic
aggregates like national income, the price level or the rate of
unemployment. Our interest in macroeconomics arises from the
following questions about modern capitalistic economies:
\begin{enumerate}
\item Economies and groups of economies go through periods of general
underutilisation of resources. Unemployment coexists
with unutilised plant and machinery. This is evidently
inefficient since putting the unemployed workers to work
on the unused machines would produce additional output
that can make at least some people better off without
making anyone worse off. Yet, in a slump the market
mechanism does not seem to work towards eliminating this
inefficiency. At least not fast enough. The Great Depression
was the most dramatic of such episodes but smaller
slumps occur quite regularly. Is this periodic
inefficiency intrinsic to a capitalistic economy or can it
be eliminated without any major changes in the structure
of the economy?
\item Economies differ dramatically in their average
standard of living and there is no systematic tendency for
this gap to close. What are the economic forces that make
some countries rich and others poor? Why haven't the poor
countries been able to close this gap by accumulating
capital and adapting the technology available to rich
countries?
\item Governments think, and citizens often agree, that the
two problems above can be at least mitigated through the
choice of appropriate public policies. Is this really so?
And if so, which policies should be adopted in which
circumstances?
\end{enumerate}
There exist diverse ways of approaching these questions. The
present book follows the `neoclassical' approach which is
currently the most popular.
The first feature of this approach, as it applies to
macroeconomics, is that we start with households, firms and
governments as our basic units of analysis. Each household,
firm and government is composed of diverse individuals who
interact with each other in complex ways. In macroeconomics
we usually ignore this interaction and consider each of
these units as a black box.
Second, we assume that each household and firm maximises a
well-defined objective function subject to the constraints
imposed on it by the institutional framework of the economy.
This is a major assumption. First, contrary to soap operas
and corporate thrillers, we assume that conflicts of
interests between the different individuals constituting a
household or firm work themselves out in a way that the unit
has a whole appears to be pursuing a coherent goal. Second,
we assume that regardless of the complexity of the
environment facing the unit it can rank all the alternatives
available to it and choose the best. Thus there is no limit
to the information processing and decision making
sophistication of the economic units.
This assumption certainly captures important aspects of
reality. Economic decisions are certainly goal-oriented and
often when the stakes are high we spend considerable effort
in trying to determine which choice is the best. Yet, both
introspection and systematic research shows that we are not
really the superoptimizers of the last paragraph. Faced with
complex situations we fall back to using simple rules of
thumb rather than carrying out the impossibly complex task
of finding the best alternatives. Our decisions are often
subject to unconscious biases. Incorporating these
departures from full optimization into economic modes is
among the most active areas of current research. However,
this research is yet to reach a consensus. Therefore, in
this text we limit ourselves for the most part to models
based on full optimization.
Third, in the neoclassical approach we look at
\emph{equilibrium} states---states where the desired actions
of different agents are all consistent with each other. The
exact form of this equilibrium condition depends on the
particular institutional structure being studied. In
competitive markets it takes the form of the equality of
demand and supply. Where strategic interactions are
important we use equilibrium concepts from game theory, the
most important of which is that of Nash equilibrium.
The justification for limiting our attention to equilibrium
states is that in any other state some agents will find that
they cannot carry out their plans or that their plans do not
have the expected outcomes. This will make them change their
behaviour. Thus a non-equilibrium state cannot persist.
This does not by itself imply that an equilibrium will
ultimately come about. The system may keep moving from one
non-equilibrium state to another forever. Only if we think
that this is unlikely and that a system away from
equilibrium will move close to equilibrium rapidly enough
are we justified in studying only equilibrium states.
\section*{References}
Other graduate-level treatments of macroeconomics
in the neoclassical paradigm,
in order or increasing difficulty, are:
\cite{romer-advmacro}, \cite{ljungqvist-sargent},
\cite{slp}. Introductions to other approaches are
\cite{davidson-pkmt} and \cite{taylor-reconstruct}.
\cite{kahneman-tfs} is a
popular account of the psychology of decision-making.
\chapter{The AS-AD model}
In this chapter we begin our study of short-run fluctuations
by reviewing the AS-AD model that you must have already
encountered as an undergraduate. We assume that the economy
is closed.
\section{Background}
Throughout we assume that there is a single produced good in
terms of which we measure real output and expenditure and a
single labour market. We also assume that there are only two
assets---money and bonds---and a single nominal interest
rate which measures the return from bonds.
The AS-AD model analyses the economy in a single time period
during which we assume that the stock of capital and the
state of expectations remain unchanged. We refer to this by
saying that AS-AD is a model of the ``short-run''.
\section{Aggregate Demand}
The demand for goods ($Z$) is made up of consumption ($C$),
investment ($I$) and government expenditure ($G$). All these
variables are measured in real terms.
We take $G$ to be given exogenously.
Households decide how much to consume based on their
current disposable income and wealth, expectations of future
disposable income and current and future needs. Of these, all
variables other than current disposable income are held
constant in the short-run. So we can write $C=C(Y-T)$ where
$Y$ is current income and $T$ is net taxes. We assume that
$T$ is given exogenously.
Firms decide how much to invest based on the current
level of the capital stock, the current and future levels of
output and the real rate of interest.
The economic story for investment is that the desired level
of capital stock depends positively on present and future
expected profitability and negatively on the real rate of
interest (since a higher real rate of interest implies that
a unit of real output in the future is worth relatively less
in terms of present output). The higher the gap between
desired and actual capital stock, the higher is the rate of
investment. Once again suppressing the variables fixed in
the short-run we have $I=I(Y,i-\pi^e)$ where $i-\pi^e$, the
difference between the nominal rate of interest and expected
inflation, equals the real rate of interest.
In equilibrium, the total output of goods must equal the
total demand for goods, that is it must be the case that
\begin{equation}
\label{eq:asad-is}
Y = C(Y-T)+I(Y,i-\pi^e)+G
\end{equation}
The satisfaction of this equation in necessary for goods
market equilibrium, but it is not sufficient. We have not
yet discussed the supply decision of firms and therefore it
is not yet clear that firms would want to supply a quantity
of output that would satisfy this equation.
Given our assumptions the $(Y,i)$ combinations which
satisfy~\eqref{eq:asad-is} form a downward-sloping curve in
the $(Y,i)$ space. It is known as the IS curve.
Looking at asset markets we assume that the demand for money
takes the form
\[M^d = PL(Y,i)\]
where $P$ is the price level. The demand for money is
increasing as a function of $Y$ since a higher level of
output also implies a higher volume of transactions and
hence a higher demand for money to finance those
transactions. It is decreasing as a function of $i$ since by
holding money consumers must forego the interest earnings on
bonds and the higher this opportunity cost the more consumers
would economise on the holding of money.
The strict proportionality between the demand for money and
the price level needs comment. Imagine a doubling of all
current prices and wages. Assuming a fixed expected
inflation rate this also implies a doubling of all expected
future prices and wages. As a result the real opportunities
available to economic agents remain unchanged. We therefore
believe that agents would carry out the same real
transactions. This assumption that real demands and supplies
depend only on real opportunities and not on nominal
quantities is known as the lack of money illusion. But
given that prices and wages have doubled the old level of real
transactions would now require exactly double the amount of
money as before to carry out.
The supply of money ($M$) is assumed to be exogenous. Equality of
the supply and demand for money give
\begin{equation}
\label{eq:asad-lm}
M/P = L(Y,i)
\end{equation}
For a given value of $P$ this is an upward-sloping curve in
the $(Y,i)$ space called the LM curve.
For each possible value of $P$ the intersection of the IS
and LM curves (or what is the same thing, the simultaneous
solution of~\eqref{eq:asad-is} and~\eqref{eq:asad-lm}) gives
us unique values of $Y$ and $i$. We can express them as
$Y(P)$ and $i(P)$. $Y(P)$ is decreasing and $i(P)$ is
increasing since an increase in $P$ moves the LM curve
leftward.
The locus $Y(P)$ in the $(Y,P)$ space is called the
aggregate demand (AD) curve. Though the names are similar,
this curve is very different from the demand curve for a
single good that we study in microeconomics.
First, the AD curve \emph{does not} show the quantity
demanded of the single good in the economy for a given price
while everything else is being held constant. If you recall,
both consumption and investment demand depend on the level
of current income. But the level of income \emph{is not} held
constant while deriving the IS curve. Rather the IS curve is
the locus of points where income is chosen such that demand
equals output. Thus the AD curve is seen as a locus of
$(Y,P)$ pairs that are consistent with equilibrium in the
goods and asset markets.
Second, the demand curve for a single normal good slopes
downwards because an increase in price decreases the real
income of consumers (whose money income is assumed to be
given) and gives them an incentive to substitute away from
the good whose price has increased. The reason for the AD
being downward sloping is entirely different. The AD curve
slopes downward because at a higher price the demand for
money is lower, which causes the LM curve to shift
leftward, decreasing the level of $Y$ where the IS and the
LM curves intersect.
\section{The AS curve}
In deriving the IS curve~\eqref{eq:asad-is} we mentioned
that goods market equilibrium additionally requires
consideration of the supply decision of firms.
One important aspect of supply decisions in the real world
is that money wages and prices are ``sticky'', i.e.\ they
don't immediately adjust fully to changes in economic
condition.
Starting from this observation we may make the simple
assumption that in the short-run wages and prices remain
fixed at whichever levels they were set at in the past and
that firms supply the level of output demanded at the given
prices. Under this assumption the $(Y,P)$ combinations
consistent with the supply decisions of the firms is a
horizontal line in the $(Y,P)$ space---the so-called
``Keynesian AS curve''---and
the intersection of this curve with the AD curve gives us
the equilibrium level of $Y$ and $P$. Since the level of $P$
is exogenously given, we may have as well derived the
equilibrium level of $Y$ from the IS-LM apparatus by drawing
the LM curve corresponding to this $P$.
Even if the exogeniety of wages and prices were to be
accepted, the ``Keynesian AS curve'' is not consistent with perfect
competition in the goods and labour markets. Perfectly
competitive firms choose the amount of labour that they
employ and hence the output that they produce by equating the
marginal product of labour to real wage. Similarly in
perfectly competitive labour markets the supply of labour is
determined by workers equating their marginal disutility
from labour to the real wage. For a given level of prices
and wage, the output and employment
implied by the profit maximisation decision of firms, the
utility maximisation decisions of workers and by the
intersection of the AS-AD curve may all three be different.
In his \textit{General Theory} (\cite{keynes-gt}) Keynes
actually allowed the level of prices to be flexible and
required that it adjust in order to ensure that the profit
maximising output of firms equal the output at which IS and
LM intersected. However, he argued that in general the
economy might be in equilibrium even when the real wage is
not equal to the marginal disutility of labour.
Keynes's framework runs into problems. First
prices are in fact sticky. They do not change immediately
when aggregate demand and output changes. Second, if we
assume diminishing marginal productivity of labour and a
real wage equal to the marginal productivity of labour then
we would expect that real wages would decrease when output
increases and vice-versa, i.e.\ real wages would be
countercyclical. However, empirical studies of real wages do
not find this countercyclical pattern. Finally, it is not
made clear why money wages would not fall in a situation of
excess supply of labour, even if we were to accept, as
Keynes seems to argue, that a fall in money wages would not
restore equilibrium.
As a result, nowadays instead of perfect competition and
flexible prices we work with models of sticky prices and
wages and use
the existence of imperfect competition in the goods and
labour markets to explain why firms and workers may be
willing to accommodate changes in demand at unchanged prices
and wages. The ``Keynesian AS curve'' can be thought of as a
simplified expression of these theories.
However this is too drastic a simplification since it makes
$P$ and $W$ entirely exogenous and therefore does not allow
us to discuss how these variables respond to economic
changes. So, for example, we cannot discuss as important an
issue as inflation in this framework.
Yet, even wages and prices which are ``sticky'' are not
fixed for ever. They do adjust with time. Firms and workers
do reset prices and wages from time to time and when they do
so they take into account the current economic conditions as
well as their expectations of future economic condition. One
observation common to many models is that higher levels of
economic activity are, other things being constant,
associated with higher prices. This is the basis of the
upward-sloping AS curves that you can find in undergraduate
textbooks. We shall not derive such a curve here (though see
the exercises) but postpone the discussion to later chapters
where we can approach it with better tools.
\section{The Way Forward}
The IS-LM and AS-AD models are still the models many
macroeconomists reach for when first trying to understand
questions related to aggregate fluctuations. But they have
shortcomings.
First, being limited to a single short-run does not allow us
to discuss the evolution of macroeconomic variables and the
effects of policies over time. For example, if we want to
understand the effects of a permanent increase in the tax
rate it is not enough to know what happens in the period in
which the increase is imposed. We would also like to know
the impact as the continued increase affects expectations
and asset choices in the economy, something which we cannot
do within the AS-AD model.
Limiting our study to a
single short-period also creates an
artificial separation between the study of growth and
fluctuations. The long-run over which growth happens is
stitched from a sequence of short-runs and at least in
principle our short-run theories must be consistent with our
long-run ones.
Second, even in a short-run the assumption that expectations
are exogenously fixed is often not plausible. While
capital stocks can only adjust slowly over time because of the
finite speed of the physical processes involved, there is no
such friction holding back changes in human beliefs.
Participants in the economy continuously revise their
beliefs in the light of new information. The kind of policy
interventions---such as changes in government expenditure or
the money stock---that we study using the AS-AD framework
also convey new information to agents and very likely change
their beliefs. Therefore there is always the likelihood of
error in studying the effects of policy changes like these
while holding beliefs constant.
In the rest of this book we study the behaviour of
firms and households in greater detail
than we have done in this chapter. We will take up issues
like the role of credit market imperfections in determining
consumption and investment or how job market search and
asymmetries of information make the labor market so
different from competitive commodity markets.
In developing our models we shall also make the role of
stocks and expectations explicit. Combining these models
with assumptions regarding the evolution of stocks and
expectations will then also enable us to go beyond a
single-short run and address the limitations of the AS-AD
model discussed above.
\section*{Exercises}
\begin{excr}
From your favourite undergraduate texts find at least
three different derivations of an upward-sloping AS curve.
State precisely the assumptions regarding firm and worker
behaviour used in each.
\end{excr}
\begin{excr}
Explain the difference between the real and the nominal
rate of interest. In the text we claimed that the real
rate of interest is the nominal rate of interest minus the
expected rate of inflation. Why is this so?
\end{excr}
\begin{excr}
We derived the IS-LM model holding the stock of money as
fixed. However, most central banks nowadays set the
nominal interest rate and allow the stock of money to
adjust so as to equal the demand for money. How would you
modify the IS-LM model to incorporate this fact? (For more
on this see \cite{romer2000keynesian}.)
\end{excr}
\section*{References}
The AS-AD model is covered in most undergraduate texts.
It turns out that setting up an AS-AD model in a way that is
logically and economically consistent is somewhat tricky and
there are different modelling choices that can be made. See
\cite{dutt2002aggregate} for a history of the model and
alternative ways of setting it up. For critiques, see
\cite{barro1994aggregate} and \cite{colander1995stories}.
\chapter{Consumption: Certainty}
Savings is not an end in itself. Rather savings is the means
a household adopts in order to provide for future
needs. Therefore we can hope to be able to better analyse
and predict consumption-saving behaviour if instead of
thinking of the household as choosing between consumption
and savings we think of the household as choosing between
fulfilling present needs and fulfilling future ones. This is
the starting point of the intertemporal theory of
consumption. To more clearly see this central tradeoff
between the present and the future we begin by assuming in
this chapter that the household faces no uncertainity
regarding future needs and opportunities. Of course, in
reality uncertainty has an important influence on
intertemporal choices and we extend our discussion to take
it into account in Chapter~\ref{ch:cons-uncertain}
\section{Two-period case}
\subsection{Budget constraint}
Consider a consumer who lives for two periods, has an
endowment of $y_1$ and $y_2$ units of goods in the two
periods respectively and can borrow and lend any amount that
they like at the real rate of interest $r$.
Suppose the consumer consumes $c_1$ in the first
period. Then she will have to take a loan of $c_1-y_1$ to
finance her consumption. (This number can be negative, in
which case the consumer is lending rather than borrowing.)
In the next period the consumer will therefore have to make
loan repayments of $(1+r)(c_1-y_1)$. Assume that the
consumer does not want to make any bequests and cannot die
with any outstanding loans, consumption in the second period
must be,
\[c_2=y_2-(1+r)(c_1-y_1)\]
Simplifying and rearranging we have
\begin{equation}\label{eq:two-period-budget}
c_1+\frac{c_2}{1+r}=y_1+\frac{y_2}{1+r}
\end{equation}
This is the budget constraint faced by the consumer. We can
interpret this to mean that the present value of the
consumer's consumption stream must equal the present value
of their incomes.
\subsection{Utility maximization}
Suppose the consumer maximises a quasiconcave utility
function $U(c_1,c_2)$ subject to this budget
constraint. Then the consumer's first-order conditions are
\begin{align}
U_1(c_1,c_2)&=\lambda\\
U_2(c_1,c_2)&=\lambda/(1+r)
\end{align}
where $\lambda$ is the Lagrange multiplier corresponding to
the budget constraint and $U_i(c_1,c_2)$ denotes the partial
derivative $\partial U/\partial c_i$. We have explicitly
shown the dependence of the partial derivatives on the value
of consumption in both periods. These first-order conditions
along with the budget
constraint~\eqref{eq:two-period-budget} together determines
the value of $c_1$, $c_2$ and~$\lambda$.
\subsection{Comparative statics}
Assuming that consumption in both periods is a normal good, an
increase in either $y_1$ or $y_2$ increases both $c_1$ and $c_2$.
The effects of a change in $r$ are ambiguous. An increase in $r$ makes
consumption in period~$2$ relatively cheap compared to consumption in
period~$1$. Therefore the substitution effect causes $c_1$ to decrease
and $c_2$ to increase. It is traditional to decompose the income
effect into two parts. First, an increase in $r$ reduces the present
value of the consumer's endowments and hence decreases his real
income. Second, an increase in $r$, by making the consumption in
period~$2$ cheaper increases his real income.\footnote{
For more about the Slutsky equation in the case of a consumer with
fixed endowments of goods see section~9.1 in Varian's
\emph{Microeconomic Analysis}, 3rd ed.} The sign of the
resultant of these two effects on consumption depends on whether the
consumer is a net lender in period~1 and a net borrower in period~2 or
vice-versa. In case the consumer is a net lender in period~1 and a net
borrower in period~2 the net income effect is positive. Assuming the
consumption in both periods in a normal good, this means that the
substitution effect and the income effect act in opposite directions
on $c_1$ in this case leading to an ambiguous effect.
\section{Many periods}
Assume that rather than just living for two periods the consumer lives
for $T+1$ periods. Further assume that the real rate of interest takes a
constant value $r$ over the consumer's lifetime. For convenience we
define $\delta=1/(1+r)$. It is also convenient to start time from
period~0 rather than period~1.
\subsection{Budget constraint}
Arguing as before, the consumer's budget constraint is
\begin{equation}\label{eq:many-period-budget}
\sum_{i=0}^T \delta^i c_i = \sum_{i=0}^T \delta^i y_i
\end{equation}
\subsection{Utility function}
We could proceed as before by assuming a utility function
$U(c_0,\ldots,c_T)$ and deriving the first order conditions. However,
because the marginal utility in each period depends on consumption in
all periods it is hard to draw any sharp conclusions at this level of
generality. Therefore we need to impose some restrictions on the form
of the utility functions.
Suppose, for example we assume that the utility function is additively
separable, i.e.
\begin{equation}\label{eq:utility-addsep}
U(c_0,\ldots,c_T)=v_0(c_0)+v_1(c_1)+\cdots+v_T(c_T)
\end{equation}
Then the first-order conditions take the form
\begin{equation}\label{eq:foc-additively-separable}
v_i'(c_i) = \delta^i \lambda \qquad i=0,\ldots,T
\end{equation}
where, as before, $\lambda$ is the Lagrange multiplier corresponding
to the budget constraint.
Sometimes we want to restrict the consumers preferences even further,
by assuming that the different $v_i$ differ from each other by only a
geometric discounting factor.
\begin{equation}\label{eq:utility-geometric}
U(c_0,\ldots,c_T)=\sum_{i=0}^T \beta^i u(c_i)
\end{equation}
where $\beta$ is a constant, referred to as the subjective rate of
discount, such that $0<\beta<1$.
In this case the first-order conditions take the particularly simple
form
\begin{equation}\label{eq:foc-stationary}
u'(c_i)=\left(\frac{\delta}{\beta}\right)^i \lambda \qquad i=0,\ldots,T
\end{equation}
In case $\delta=\beta$, this implies that $u'(c_i)$ is the same for
all $i$, which, assuming that $u'(\cdot)$ is a strictly decreasing
function, means that $c_i$ is constant for all $i$. The present
period's income does not influence the present period's consumption at
all. Consumption is determined solely by lifetime resources as given
by~\eqref{eq:many-period-budget}.
The case $\delta \neq \beta$ is also instructive. Suppose
$\delta>\beta$. In this case it follows from~\eqref{eq:foc-stationary}
that consumption decreases over time. Formally, this is because if
$\delta>\beta$ then by~\eqref{eq:foc-stationary} $u'(c_i)$ increases
over time, and since $u'(c)$ is a decreasing function of consumption,
this implies that $c$ decreases over time.
The economic logic behind this result is that $\delta$ is the number of
units of consumption we have to give up at present in order to
purchase one more unit of consumption next period, whereas $\beta$ is
the number of units of marginal utility we are willing to give up at
present in order to have one more unit of marginal utility in the next
period. Suppose we start with the same consumption $c$ in this period and
the next. If we reduce consumption in the next period by a small
amount $\Delta c$ then at the prevailing market prices we can
increase present consumption by $\delta\Delta c$. The increase in
utility from the increase in present consumption is approximately
$u'(c)(\delta\Delta c)$.\footnote{We are using Taylor's theorem:
$u(c+\delta\Delta c)-u(c) \approx u'(c)(\delta\Delta c)$} The decrease in utility from the reduction in
next period's consumption is approximately $\beta u'(c)(\Delta c)$. The
net change in utility would be $(\delta-\beta)u'(c)(\Delta c)$ which
is positive when $\delta>\beta$. Thus it is beneficial to increase present consumption and
reduce future consumption if we are starting from a position of
equality. Indeed, it will be optimal to increase consumption in the
present period (say period $i$) and decrease consumption in the next
period (period $i+1$) till the following equality between the MRS and
the price ratio is satisfied,
\[\frac{u'(c_{i+1})}{u'(c_i)}=\frac{\delta}{\beta}\]
If $\delta/\beta$ is close to $1$ then $c_{i+1}$ is close to $c_i$ and
we can use Taylor's Theorem from calculus to the above equation to the
above equation to get a useful approximation.
\begin{align*}
\frac{u'(c_{i+1})}{u'(c_i)}&=\frac{\delta}{\beta}\\
\frac{u'(c_{i+1})-u'(c_i)}{u'(c_i)}&=\frac{\delta}{\beta}-1\\
\intertext{Applying Taylor's Theorem}
\frac{u''(c_i)(c_{i+1}-c_i)}{u'(c_i)}&\approx\frac{\delta}{\beta}-1\\
\intertext{Defining $\Delta c=c_{i+1}-c_i$, and dropping the subscript
$i$,}
\left(\frac{u''(c)c}{u'(c)}\right)\left(\frac{\Delta
c}{c}\right)
&\approx\frac{\delta}{\beta}-1\\
\intertext{The quantity $\sigma=-u'(c)/cu''(c)$ is known as the
\emph{intertemporal elasticity of substitution} and captures the
sensitivity of marginal utility of changes in consumption. It is positive since marginal utility
decreases with consumption.}
\left(\frac{\Delta
c}{c}\right)&\approx
\sigma\left(1-\frac{\delta}{\beta}\right)
\end{align*}
The formula confirms our earlier reasoning that consumption decreases
over time if $\delta>\beta$. Moreover, it shows that the sensitivity
of the growth of consumption on the rate of return depends
on the intertemporal elasticity of substitution. This is because the
intertemporal elasticity of substitution is the reciprocal of the
elasticity of marginal utility with respect to the level of
consumption. The more elastic is marginal utility to consumption, the smaller
is the deviation in consumption from a constant path that is required
the equate the ratio of marginal utilities in consecutive time periods
to $\delta/\beta$.
\subsection{Exogenous variables}
It is possible to unify~\eqref{eq:utility-addsep}
and~\eqref{eq:utility-geometric} by writing
\[v_i(c_i)=\beta^i u(c_i,\xi_i)\]
where $\xi_i$ is an exogenous variable such a the consumer's age or
the number of members in the household. In this case the first-order
conditions become
\[u'(c_i,\xi_i)=\left(\frac{\delta}{\beta}\right)^i \lambda \qquad
i=0,\ldots,T\]
Knowing how $\xi$ affects the marginal utility would
now let us make some predictions regarding the path of consumption.
\subsection{Comparative statics}
Assuming that consumption in every period is a normal good, an
increase in $y_i$ increases every $c_i$.
The effect of an increase in $r$, or equivalently, a decrease in
$\delta$ remains ambiguous because of the same income and substitution
effects as discussed earlier. But for the utility function given
by~\eqref{eq:utility-geometric}, we can say a little
more. From~\eqref{eq:foc-stationary} we can see that a decrease in
$\delta$ means that the \emph{growth rate} of consumption speeds
up. Remember that even in this case we do not have any information
regarding the \emph{level} of consumption in any period since the
level would depend on $\lambda$ which in turn depends on
$\delta$.
\section*{Exercises}
\begin{exercises}
\item A consumer consumes a single good in two
periods---period~1 and period~2. Let her consumption in the two
periods be denoted by $c_1$ and $c_2$ respectively. The consumer has
an endowment of $e_1$ and $e_2$ units of consumption in the two
periods respectively. The consumer has no other sources of income or
wealth. Assume that the \emph{money} price of the consumption good
in the two periods is $P_1$ and $P_2$ respectively and the nominal
interest rate between the two periods is $i$.
\begin{enumerate}
\item Write down the consumer's budget constraint.
\item Write down an exact (not approximate) formula for the real
interest rate in this setting in terms of $P_1$, $P_2$ and
$i$. (Think of how many additional units of consumption you can
get in period~2 if you give up one unit of consumption in
period~1).
\item Argue that the consumer's budget set depends only on the
real rate of interest, i.e., combinations of changes in prices and
the nominal interest rate which leave the real interest rate
unchanged also leave the consumer's budget set unchanged.
\item Assume that the consumer's utility function is given by:
\[U(c_1,c_2) = c_1c_2\]
Calculate the amount consumed and the amount saved by the consumer
in period~1 as a function of the real interest rate. Sketch rough
graphs of these two functions.
\end{enumerate}
\item Consider a consumer who lives from period $0$ to $T$,
has an initial wealth $w$ and no other sources of income. Suppose
that the consumer has additive separable preferences with the
felicity function,
\[v_i(c_i)=\beta^i c_i^{1-\rho},\quad 0<\rho<1.\]
The consumer can lend and borrow any amount she wishes at the real
rate of interest~$r$.
\begin{enumerate}
\item What is the intertemporal elasticity of substitution
corresponding to this consumer's felicity function?
\item Use the first-order conditions of the conumer's utility
maximization problem (not a linear approximation) to show that the
optimal consumption path chosen by this consumer shows a constant
rate of growth of consumption. Derive an expression for the growth
rate of consumption in terms of $\beta$, $r$ and~$\rho$.
\end{enumerate}
\item Consider a consumer who lives for two periods and must
decide on how much to spend on a durable good in each of the two
periods. The consumer's utility function is given by
\[U(x_1,x_2)=[x_1^{1-\rho}+x_2^{1-\rho}]/(1-\rho),\qquad \rho>0\]
where $x_1$ and $x_2$ is the stock of durable goods held by the
consumer in the two periods.
The stocks are related to the
consumer's expediture $c_1$ and $c_2$ in the two periods by
\begin{align*}
x_1&=c_1\\
x_2&=\gamma x_1+c_2,\qquad{0<\gamma<1}
\end{align*}
where $(1-\gamma)$ is the rate at which the stock of the
durable depreciates.
The consumer has an initial wealth $w$ and no other source
of income. She is free to lend and borrow at the interest
rate $r$.
Under what conditions on $\gamma$, $R$ and $\rho$ will the consumer
not spend anything in the second period? Give an economic
interpretation for your result.
\item A and B are two agents who derive satisfaction from
the consumption of leisure and apples over a number of
periods. For A leisure and apples are substitutes whereas
for B leisure and apples are complements. Suppose an
exogenous shock reduces the leisure available to both in a
given period without affecting their incomes. How will the
consumption of apples in that period change for each?
\item Consider a consumer who lives for two periods. The
consumer has a real earning of $y_1$ and $y_2$ in the two
periods respectively and must choose his level of real
consumption $c_1$ and $c_2$ in the two periods.
The consumer can lend and borrow at the real rate of
interest~$r$. However, the consumer cannot borrow more
than a fraction $\theta$ ($0<\theta<1$) of the present value of his second period
earnings. That is, if $b_1$ is the amount borrowed by the
consumer in the first period then it must be the case that
\[b_1 \leq \frac{\theta y_2}{1+r}.\] There are no
restrictions on the amount that the consumer can lend.
\begin{enumerate}
\item Sketch the consumer's budget set in the $c_1$, $c_2$ plane.
\item Suppose that the consumer's utility function is given by
\[U(c_1,c_2)=\ln c_1+\ln c_2.\] Calculate the
consumer's first-period consumption demand ($c_1$) as
a function of $y_1$, $y_2$, $\theta$ and~$r$.
[Hint: Take the possibility of a corner solution into account.]
\item Calculate this consumer's first-period marginal
propensity to consume $\partial c_1/\partial y_1$. How
does this marginal propensity to consume change with
changes in $y_1$? Explain your answer in economic
terms.
\end{enumerate}
\item Consider the following example of a two-period utility
function with habit formation:
\[U(c_1,c_2)=\frac{1}{1-\rho}[c_1^{1-\rho}+(c_2-\gamma
c_1)^{1-\rho}],\qquad \gamma>0,\rho>0\]
\begin{enumerate}
\item Suppose a consumer with these preferences has an
initial wealth $w$, no other sources of income and can
freely lend and borrow at the interest rate~$r>0$.
What will be the consumer's optimal choice of $c_1$
and $c_2$?
\item An economist wishes to use observed data on $c_1$,
$c_2$ and~$r$ to estimate the parameter $\rho$. But
the economist mistakenly assumes that there is no
habit formation. That is, the economist mistakenly
assumes that the consumer's preferences are
\[U(c_1,c_2)=\frac{1}{1-\rho}[c_1^{1-\rho}+c_2^{1-\rho}],\qquad
\rho>0.\]
Will this economist's estimate of $\rho$ be higher or
lower than the true value?
\end{enumerate}
\end{exercises}
\chapter{The Envelope Theorem}
\section{Parametrised optimisation problems}
Let's think of unconstrained problems first. Every optimisation
problem has an objective function. It is the function that we are
trying to maximise or minimise (henceforth maximise). Some of the variables entering the
objective function are \emph{choice variables}, variables whose values
we are free to choose in order to maximise the objective function. But
all the variables entering into the objective function need not be
choice variables. The value of the objective function may also depend
on the value of other variables which we are not free to choose. We
call these the \emph{parameters} of the optimisation problem.
\begin{exmp}\label{exmp:opti:profit}
Consider the short-term profit
maximising problem of a firm that produces according
to the production function
\[y=f(L,K)=L^{1/2}K^{1/2}\]
In the short-run the capital stock of the firm is fixed at some value
$\bar K$ and the firm can only choose the labour input $L$. If the
firms buys labour and capital in perfectly competitive labour market at prices
$w$ and $r$ respectively and sells its output in a perfectly competitive market at the
price $p$ then its profits are:
\[\pi(L,\bar K)=py-wL-r\bar K=pL^{1/2}{\bar K}^{1/2}-wL-r\bar K\]
For the short-run profit maximising problem $\pi(L,\bar K)$ is the
objective function, with $L$ as a choice variable and $\bar K$ as a
parameter.\footnote{In fact $p$, $r$ and $w$ are also parameters in
the profit function. But we shall ignore this fact for now since we
will not be looking at the effects of changes in these variables.}
Denoting the optimal amount of labour input by $L^*$, the first-order condition for profit maximisation is,
\begin{align*}
\frac{\partial \pi}{\partial L}&=0\\
\frac{1}{2}p{L^*}^{-1/2}{\bar K}^{1/2}-w&=0\\
L^*&={\bar K}(p/2w)^2
\end{align*}
You should check that $\pi(L,\bar K)$ is a concave function of $L$ and
therefore the first-order condition is sufficient to give us a global
maximum. The profit earned by the firm at the optimal point is,
\begin{align*}
\pi^*&=\pi(L^*,\bar K)\\
&=p[{\bar K}^{1/2}(p/2w)]{\bar K}^{1/2}-w[{\bar K}(p/2w)^2]-r\bar K\\
&={\bar K}(p^2/2w)-{\bar K}(p^2/4w)-r\bar K\\
&={\bar K}(p^2/4w)-r\bar K
\end{align*}
We see that both the amount of labour input chosen by the firm and the
maximum profit it earns are functions of the value of the parameter
$\bar K$. The function mapping the parameter values to the maximum (or
minimum) value of the objective function is called the \emph{value
function}. In this case, denoting the value function by $V(\cdot)$
we have,
\[V(\bar K)=\pi^*={\bar K}(p^2/4w)-r\bar K\]
\qed
\end{exmp}
\section{The envelope theorem}
How does the optimal value change when we change the parameters? In
our example since we have an explicit formula for the value function
we can calculate its value directly
\[V'(\bar K) =(p^2/4w)-r\bar K\]
Even when we do not have an explicit formula for the value function,
there is an interesting relationship between the partial derivatives
of the objective function and the partial derivatives of the value
function.
Consider the general problem of maximising the objective function
\[\phi(x_1,\ldots,x_n;c_1,\cdots,c_m)\]
where the $x_i$ are choice variables and $c_i$ are parameters.
The first order conditions for the problem are,
\begin{equation}\label{eq:opti:foc}
\frac{\partial \phi}{\partial
x_i}(x_1,\ldots,x_n;c_1,\ldots,c_m)=0\qquad i=1,\cdots,n
\end{equation}
Just as in the example, the optimal values of the choice variables,
denoted by $x_i^*$, will be functions of the parameters
$c_1,\ldots,c_m$. The value function will be given by
\[V(c_1,\ldots,c_m)=\phi(x_1^*,\ldots,x_n^*;c_1,\ldots,c_m)\]
Suppose we want to calculate the partial derivative of the value
function with respect to one of the parameters, say $c_j$. In doing so
we have to take into account the fact that the optimal value of each
of the choice variables would also be a function of $c_i$.
If we assume that the mapping from the $c_i$ to the optimal
values of the choice variables is differentiable, we can use the
chain rule,
\[\frac{\partial V}{\partial c_j}=
\frac{\partial \phi}{\partial x_1}\frac{\partial x_1^*}{\partial c_j}
+\cdots
+\frac{\partial \phi}{\partial x_n}\frac{\partial x_n^*}{\partial c_j}
+\frac{\partial \phi}{\partial c_j}
\]
However, from~\eqref{eq:opti:foc}, we know that $\partial
\phi/\partial x_i$ is $0$ for all $i$ when the partial derivatives are
evaluated at the optimal values. So we have,
\begin{equation}\label{eq:opti:envelope}
\frac{\partial V}{\partial c_j}=\frac{\partial \phi}{\partial c_j}
\end{equation}
This remarkably is the same result that we would have got if we had
treated each of the $x_i^*$ as a constant. But that would not have
been justified since the choice variables do vary when parameters are
varied. That is, $\partial x_i^*/\partial c_j$ is generally not zero.
It is just that when we are starting from an optimal point then the
marginal impact on this variation on the objective function (i.e.,
$\partial \phi/\partial x_i$ is zero and therefore we can ignore the
changes in the choice variables.
Equation~\eqref{eq:opti:envelope} is known as the ``Envelope
Theorem''.
\section{Geometric Interpretation}
\begin{figure}
\includegraphics[width=0.9\textwidth]{envelope.pdf}
\caption{The Envelope Theorem}\label{fig:opti:envelope}
\end{figure}
Figure~\ref{fig:opti:envelope} illustrates the envelope theorem in the
case of Example~\ref{exmp:opti:profit}. Each of the coloured curves
shows the level of profit for a given level of $L$ and for different
values of $K$. Let's call them ``profit curves''.\footnote{This is not
standard terminology and you must
remember that these curves are not graphs of the full profit function since we
are holding $L$ constant on each of them.} We have drawn only three of
these curves but you should imagine there to be one curve for each
possible value of $L$. Now, since our purpose is to maximise profit
for a given value of $K$, we move along a vertical line for our
particular value of $K$ and choose that $L$ whose profit curve is the
highest at that value of $K$.
Thus, for example, at $K=4.0$ we would choose $L=4.0$ whereas at
$K=10.0$ we would choose $L=2.5$.
The value of the highest profit curve for a given $K$ gives us the
highest profit we can obtain when $K$ takes on that value. But that is
precisely the definition of the value function. Therefore the graph of
the value function touches the highest of the profit curves at each
$K$. Or, in other words, the graph of the value function (the black
line in the figure) must be the upper
envelope of the graphs of the profit functions for given values of
$L$.
Since the value function is the upper envelope of the profit curves,
no profit curve can ever cross it. But at each value of $K$ one of the
profit curves, corresponding to the optimal $L$, touches it. The only
way two graphs can touch without crossing is if they are tangent to
each other. The slope of the graph of the value function is $\partial
V/\partial K$ whereas the slope of the profit curves is $\partial
\pi/\partial K$. Tangency of the two graphs implies that these slopes
should be equal, which is precisely what our the envelope theorem in
eq.~\eqref{eq:opti:envelope} also says when applied to this example.
Now you know what the envelope theorem is called by that name.
\section{Constrained Optimisation}
So far we have discussed unconstrained problems. There is also a
version of the envelope theorem for constrained optimisation problems.
Suppose our problem is to maximise
\begin{equation*}
\phi(x_1,\ldots,x_n;c_1,\ldots,c_m)\\
\end{equation*}
subject to the constraint
\begin{equation}\label{eq:opti:constraint}
h(x_1,\ldots,x_n;c_,\ldots,c_m)=0
\end{equation}
Here we have allowed both the objective function and the constraint to
depend on a set of parameters.
The first-order condition for this problem is
\begin{equation}\label{eq:opti:cfoc}
\frac{\partial \phi}{\partial x_i}
=\lambda \frac{\partial h}{\partial x_i}
\qquad i=1,\ldots n
\end{equation}
where $\lambda$ is a Lagrange multiplier.
As before, if the problem has a solution the optimal values
of the choice variables, the $x_i^*$, will be functions of
the parameters of the problem. Once again we look at the
case where this mapping is differentiable.\footnote{For
sufficient conditions that this be so
see~\cite[Section~11.7]{luenberger-lin-nonlin}.} Also as
before, we can define the value function as
\[V(c_1,\ldots,c_m)=\phi(x_1^*,\ldots,x_n^*;c_1,\ldots,c_m)\]
Differentiating the value function with respect to $c_j$ gives us,
\begin{equation}\label{eq:opti:value-chain}
\frac{\partial V}{\partial c_j}=
\frac{\partial \phi}{\partial x_1}\frac{\partial x^*_1}{\partial c_j}
+\cdots
+\frac{\partial \phi}{\partial x_n}\frac{\partial x^*_n}{\partial c_j}
+\frac{\partial \phi}{\partial c_j}
\end{equation}
To simplify this we need to digress a bit. The optimal values of the
choice variables must satisfy the
constraint~\eqref{eq:opti:constraint} for all values of the parameters, so we have
\[h(x_1^*,\ldots,x_n^*;c_,\ldots,c_m)=0.\]
Differentiating this with respect to
$c_j$ we get
\[\frac{\partial h}{\partial x_1}\frac{\partial x^*_1}{\partial c_j}