NBER WORKING PAPER SERIES EXCHANGE RATE DYNAMICS, LEARNING AND MISPERCEPTION Pierre-Olivier Gourinchas Aaron Tornell Working Paper 9391 http://www.nber.org/papers/w9391 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 December 2002 Comments welcome. This paper supersedes our previous paper, “Exchange Rate Dynamics and Learnings", NBER WP 5530. We thank Daron Acemoglu, Olivier Blanchard, Ricardo Caballero, John Campbell, Rudi Dornbusch, Charles Engel, Jeff Frankel, Ben Friedman, Graciela Kaminski, John Leahy, Paul O'Connell, Julio Rotemberg, Nouriel Roubini and seminar participants at UCLA for useful discussions and comments. All errors are our own. The views expressed herein are those of the authors and not necessarily those of the National Bureau of Economic Research. © 2002 by Pierre-Olivier Gourinchas and Aaron Tornell. All rights reserved. Short sections of text not to exceed two paragraphs, may be quoted without explicit permission provided that full credit including, © notice, is given to the source. Exchange Rate Dynamics, Learning and Misperception Pierre-Olivier Gourinchas and Aaron Tornell NBER Working Paper No. 9391 December 2002 JEL No. E4, F31, G1 ABSTRACT We propose a new explanation for the forward-premium and the delayed-overshooting puz- zles. Both puzzles arise from a systematic under-reaction of short-term interest rate forecasts to current innovations. Accordingly, the forward premium is always a biased predictor of future depreciation; the bias can be so severe as to lead to negative coeffcients in the “Fama" regression; delayed overshooting may or may not occur depending upon the persistence of interest rate innovations and the degree of under-reaction; lastly, for G-7 countries against the U.S., these puzzles can be rationalized for values of the model's parameters that match empirical estimates. Pierre-Olivier Gourinchas Aaron Tornell Princeton University Department of Economics 311 Fisher Hall UCLA Economics Department 405 Hilgard Avenue, Bunche Hall #8283 Princeton, NJ 08544 Los Angeles, CA 90095-1477 and NBER and NBER

[email protected] [email protected]

Exchange Rate Dynamics, Learning and Misperception Abstract. We propose a new explanation for the forward-premium and the delayed-overshooting puz- zles. Both puzzles arise from a systematic under-reaction of short-term interest rate forecasts to current innovations. Accordingly, the forward premium is always a biased predictor of future depreciation; the bias can be so severe as to lead to negative coe±cients in the \Fama" regres- sion; delayed overshooting may or may not occur depending upon the persistence of interest rate innovations and the degree of under-reaction; lastly, for G-7 countries against the U.S., these puzzles can be rationalized for values of the model's parameters that match empirical estimates. J.E.L. Classi¯cation: E4, F31, G1. This paper proposes a new explanation for the forward premium and delayed overshooting puzzles by demonstrating empirically how both puzzles can arise from a systematic under-reaction of short term interest rate forecasts to innovations in current interest rates. Over the past twenty years, a large body of empirical literature has documented the existence of large biases in forward premia for predicting future changes in exchange rates.1 This Forward Premium Puzzle (FPP) implies typically that di®erentials between the domestic and foreign nominal interest rates bear little predictive power for the future rate of change in spot rates. If anything, forward rates and expected depreciation tend to move in opposite directions: a positive interest rate di®erential is more often than not associated with a subsequent appreciation of the exchange rate, not the depreciation that theory predicts. This empirical regularity implies signi¯cant predictable excess returns in currency markets. A lesser known puzzle, the delayed overshooting puzzle, has been uncovered more recently by Eichenbaum and Evans (1995). These authors ¯nd that unanticipated contractionary shocks to U.S. monetary policy are followed by (a) a persistent increase in U.S. interest rates, and (b) a gradual appreciation of the dollar, followed by a gradual depreciation several months later. This `delayed overshooting' pattern is consistent with predictable excess returns and the forward premium puzzle: 1 for a while, U.S. interest rates are higher than their foreign counterparts, with an associated forward discount, yet the dollar appreciates, yielding positive excess returns. This dynamic pattern is also in contradiction with Dornbusch (1976)'s overshooting result, whereby the exchange rate should experience an immediate appreciation, and then depreciate gradually towards its new long run equilibrium value. The existence of predictable excess returns, implied by these two puzzles, re°ects time-varying currency risk premia and/or expectational errors. The evidence we describe below indicates that the former channel is not su±cient to explain the highly volatile predictable excess returns implied by the puzzles. In fact Frankel and Froot (1989) ¯nd that expectational errors in foreign exchange markets play a key role. In this paper we investigate whether interest rate forecast errors observed in the data can ratio- nalize simultaneously these two exchange rate puzzles. Our starting point is a setup where agents constantly learn about the duration of interest rate shocks: transitory versus persistent.2 In and by itself, the fact that agents constantly learn about the duration of interest rate innovations is not enough to rationalize the two exchange rate puzzles. Since rational agents cannot be system- atically fooled, there cannot exist predictable excess returns, which are necessary for the delayed overshooting and the forward premium puzzles. However, using a unique survey data set on interest rate expectations, published by the Finan- cial Times Currency Forecaster, with monthly observations from 1986 to 1995 for all G-7 countries, we ¯nd substantial evidence of under-reaction to interest rate innovations. More speci¯cally, we ¯nd (a) no evidence of transitory shocks in the forward premium; (b) yet implicitly market par- ticipants expect incorrectly a substantial share of interest rate innovations to be purely transitory. This contrast is striking: the relative variance of transitory shocks implicitly assumed by market participants is often signi¯cantly larger than one, indicating that most innovations are perceived as transitory. These results mirror those of Campbell and Shiller (1991). We will not take a stand on the origins of the forecasts' short-run under-reaction. What we do is develop a uni¯ed model that allows us to link interest rate forecast to the two puzzles alluded to above. The only deviation from full rationality in the model is the misperception about second moments implicit in interest rate forecasts. Given these forecasts, the exchange rate is determined 2 by the standard no arbitrage condition. We demonstrate that for typical values of the misperception, the equilibrium exchange rate in the model exhibits both delayed overshooting and the forward premium puzzle in its most extreme form {i.e., a negative Fama coe±cient. To gain some intuition for this result, consider the following experiment. Suppose that do- mestic interest rates increase vis-a-vis constant foreign short rates, then return gradually to their equilibrium value. If agents know the exact nature of the shock, the exchange rate immediately appreciates relative to its long run value up to the point where the expected future depreciation compensates for the interest rate di®erential. It then progressively reverts to equilibrium as the in- terest rate di®erential declines. This is the interest rate e®ect: there is overshooting and uncovered interest parity holds. Suppose now that agents misperceive the shock as transitory. On impact, agents believe the domestic interest rate will revert to its equilibrium value fairly rapidly. The exchange rate may initially appreciate moderately. In the next period, the interest rate is in fact higher than agents expected, leading to an upward revision in beliefs regarding the persistence of the shocks. This revision leads to an appreciation. This is the learning e®ect. However, the domes- tic interest rate is also reverting to its equilibrium value, leading to a depreciation. If the learning e®ect is strong enough so as to dominate the interest rate e®ect, there is a gradual appreciation of the currency. Eventually, there is not much more to learn and the interest rate e®ect dominates. Thus, the exchange rate reverts to its equilibrium value. Along this path, there are positive excess returns in the domestic currency, and the forward premium is negatively correlated with expected appreciation. The previous intuition describing the conditional response to an interest rate innovation carries over to an unconditional statement about the forward premium. While we show that the forward premium bias always arises as soon as there is misperception, the most extreme form of this puzzle (a negative Fama coe±cient), as well as delayed overshooting, depend upon the parameters of the model. Intuitively then, hump-shaped exchange rate dynamics result from the interaction of the mis- perception about the relative importance of interest rate shocks (transitory vs. persistent) and the gradual mean reversion of interest rates. Note that whether there exists delayed overshooting and a negative Fama coe±cient in our model depends on misperceptions about second moments of the 3 interest rate process. Misperceptions about ¯rst moments play no role. Note also that in our model predictable excess returns vanish at long horizons, and the Fama coe±cient converges to one. This is in line with the empirical evidence. To sum up, we propose a model that links under-reaction of interest rate forecasts to the forward discount and the delayed overshooting puzzles. We are successful along a number of dimensions: (i) using a unique survey of interest-rate forecasts for the G-7 countries, we demonstrate empirically a systematic under-reaction of forecasts to short term changes in interest rates;3 (ii) according to our model, the forward premium is almost always a biased predictor of future changes in the spot rate; (iii) moreover, the model can accommodate the most extreme forms of the bias (negative Fama coe±cients) unlike most of the previous literature; (iv) depending upon the parameters of the model, we may or may not obtain a delayed overshooting response of nominal exchange rates to monetary shocks. This is empirically satisfying since delayed overshooting seems much less prevalent and robust than the forward premium puzzle. The next subsection describes the related literature. Section 1 presents a simple version of the model. Section 2 documents the empirical evidence on the expectational and term structure components of the forward discount, reproducing results from Frankel and Froot (1989). We also present evidence on the systematic under-reaction of short rate forecasts. Section 3 presents the model. Section 4 concludes. Most proofs are included in the appendix. 0.1. Related Literature. Delayed overshooting and the forward premium puzzle are both statements about predictable excess returns. Yet they di®er in subtle ways. The former is a statement about the joint conditional response of nominal interest rates and exchange rates to a common unanticipated monetary innovation. The later is an unconditional statement. Empirically, the forward premium puzzle seems much more prevalent, albeit not always in its most extreme form.4 In an accounting sense, there are two possible explanations for the forward premium puzzle: time-varying risk premium and/or expectational errors. To see this, start from the standard log- linearized arbitrage condition: (1) Etm et+1 ¡ et = rt ¡ rt¤ ¡ ³ t 4 where et is the log of the domestic price of the foreign currency, rt and rt¤ are respectively the domestic and foreign one-period nominal interest rate, and ³ t is a domestic currency risk premium. Here, Etm et+1 represents the market expectation of next period exchange rate, which may di®er from statistical or rational expectations, denoted by Et et+1 : According to equation (1), the return on the short domestic bond, rt ; is equal to the return on a foreign bond of the same maturity, rt¤ ; adjusted for the market's expectations of depreciation Etm et+1 ¡ et ; as well as a risk premium component ³ t : Of course, as it stands this arbitrage relationship has no empirical power since both market expectations and the risk premium are unobservable. Empirical tests make two additional assump- tions: (a) expectations are rational in the sense that Etm et+1 = Et et+1 ; (b) the risk premium ³ t is constant or uncorrelated with the forward premium rt ¡ rt¤ . Tests of (1) under these assumptions fare quite badly (Fama (1984)): regressions of the form et+1 ¡ et = ®+¯ (rt ¡ rt¤ )+ ut+1 (the `Fama regression') typically ¯nd a ¯ signi¯cantly smaller than 1, and often negative. Table 1 reports the results from such a regression.5 The results are typical of the literature: at short horizons, the Fama coe±cient ¯ is often signi¯cantly negative. At longer horizons, ¯ is often not statistically di®erent from 1: This `forward premium puzzle' or `Fama puzzle' implies time-varying predictable excess returns de¯ned as: (2) » t = (rt ¡ rt¤ ) ¡ (Et et+1 ¡ et ) = (Etm et+1 ¡ Et et+1 ) + ³ t According to equation (2), predictable excess returns result from expectational errors and/or a currency risk premium. The time-varying risk premium school of thought argues that expectations are rational, markets e±cient, and °uctuations in the forward discount re°ect changes in underlying risks. As Fama (1984) points out, the typical estimated bias in the forward premium implies that the currency risk premium ³ t must be more volatile than predictable excess returns » t . In equilibrium models, the currency risk premium °uctuates with relative asset supplies, conditional variances, and risk aversion. It is di±cult to reconcile the low volatility of the above-mentioned variables with the high volatility of predictable excess returns, unless one invokes unrealistically high risk aversion coe±cients.6 A more recent line of research, using a±ne models, characterizes directly restrictions on the 5 time-series properties of pricing kernels and the underlying risk factors consistent with the forward premium anomaly (see Backus, Foresi and Telmer (1998) and Sa'a-Requejo (1994)). While of great interest to the practitioner, it is di±cult to establish equilibrium foundations for the implied pricing kernels. Maybe the best empirical evidence is provided by Frankel and Froot (1989). Using survey data on exchange rate forecasts, Frankel and Froot decomposed predictable excess returns into their currency risk premium and expectational error components. Their results indicate clearly that (a) at short horizons, almost none of the forward premium bias can be attributed to currency risk premium °uctuations and (b) changes in the forward premium re°ect almost one for one changes in expected appreciation/depreciation. Table 2 reproduces their analysis over the period 1986- 1995, with substantially similar results.7 The direct conclusion is that expectational errors must be responsible for a large part of the forward premium puzzle. Learning about a one-time change in regime has been analyzed by Lewis (1989a) and (1989b). In that model, following a change in regime, agents gradually update their beliefs about the current state of the world, generating systematic forecast errors during the transition. These learning models can explain some part of the exchange rate mispredictions implied by the forward premium bias, although not the more extreme form where expected depreciation and forward premium move in opposite directions. In general, models of learning about infrequent regime shifts have a di±cult time matching the size of the bias and do not account for the fact that predictable excess returns do not appear to die out over time between infrequent regime switches.8 Further, since regime shifts generate forecast errors that die out over time, models based on learning about a one-time change in regime cannot be expected to deliver neither hump-shaped exchange rate dynamics, nor negative coe±cients in the Fama regression. There is an interesting parallel between our results and well established empirical regularities on asset returns. A large volume of empirical work has documented various ways in which asset returns -especially stock returns- are partly predictable based on publicly available information. Of particular interest to us, numerous studies have documented that asset prices under-react to news in the short run. Cutler, Poterba and Summers (1991) show that aggregate indices tend to be positively serially correlated at short horizons, while Jegadeesh and Titman (1993) and Fama 6 and French (1992) present similar evidence on cross-sections of individual stocks. Stock returns also under-react to announcements of public information (such as earnings, see Bernard (1992), or Chan, Jegadeesh and Lakonishok (1995)).9 Thus under-reaction to publicly available news seem to be a prevalent fact. Paralleling research in International Finance, the consensus in Finance is that the short term under-reaction cannot be explained in terms of risk. Instead, recent models in Behavioral Finance attempt to rationalize asset price puzzles in terms of optimal strategies of boundedly rational agents.10 Unlike these behavioral models, we will not take a stand on the origins of the under- reaction. We leave it for future research to investigate the conditions under which such behavior might arise in equilibrium. In particular, we do not rule out that measured expectations re°ect statistical expectations conditioned upon a subset of the publicly available information, although we have some doubts as to whether this is a valid research strategy. 1. A Simple Case In this section we present a simpli¯ed version of the model of Section 3. Suppose that agents are risk neutral (³ t = 0) so that all predictable excess returns arise from expectational errors. Iterating equation (2) forward, we can express predictable excess returns as function of the sequence of forward premium (xt = rt ¡ rt¤ ) expectational errors: X1 £ ¤ »t = ¡ Etm xt+j ¡ Et Et+1 m xt+j + (¹ et ¡ Et e¹t+1 ) j=1 where e¹t = limT !1 Etm et+T de¯nes the long run equilibrium value of the exchange rate that satis¯es Etm e¹t+1 = e¹t . This makes clear that in the absence of currency risk premium (³ t = 0) ; the forward premium puzzle can be rationalized only if market expectations of interest rate di®erentials xt di®er from their statistical expectations, at least at some horizon, or if forecasts of the long run equilibrium exchange rate are incorrect. In this paper we focus on interest rate forecast errors, as demonstrated by the existence of short term under-reaction of forecasts of interest rate di®erentials between the US and the other G7 countries. We abstract from long run forecast errors by assuming Et e¹t+1 = e¹t .11 7 To see whether this forecast under-reaction can explain delayed overshooting and the more extreme form of the forward premium puzzle (a negative coe±cient in the Fama regression), let us represent our empirical ¯ndings in the following way. Suppose that the forward premium follows an AR process with autocorrelation ¸: (3) xt = zt = ¸zt¡1 + ²t However, assume that agents perceive instead that the forward premium is given by: (4) xt = zt + vt where the shocks ²t and vt are i.i.d. normal variables with mean zero and respective variances ¾2² and ¾ 2v : That is, agents erroneously perceive that, in addition to the persistent component xt , the forward premium also contains a transitory component vt . Given their perceptions, agents form their forecasts and choose their portfolio optimally. Taking the forward premium process (3)-(4) as given, agents solve a standard signal extraction problem using Bayes law. As we show in Section 3, the agents' forecast of the forward premium Etm xt+1 is given by: (5) Etm xt+1 = (1 ¡ k) ¸ Et¡1 m xt + k¸ xt ¸2 ¾2 +¾ 2² The weight given to current observations relative to past expectations is k = ¸2 ¾ 2 +¾2² +¾2v ; where ¾2 is the variance of the estimate Etm xt+1 .12 Note that when ¾2v = 0; we have that k = 1 and market expectations Etm xt+1 correspond to statistical expectations. In contrast, when agents misperceive shocks to be more transitory than what they actually are (¾2v > 0), we have that k < 1: Given agents' expectations (5), the no-arbitrage condition (1) implies that the exchange rate satis¯es xt 1 (6) et = e¹t ¡ + [Et xt+1 ¡ Etm xt+1 ] 1¡¸ 1¡¸ where the stochastic trend e¹t satis¯es e¹t = Et e¹t+1 : It follows from (6) that there are positive predictable excess returns (» t > 0) when the expected forward premium falls short of its statistical expectation (Etm xt+1 < Et xt+1 ); i.e. when short term interest rate forecasts systematically under- react 8 µ ¶ ¸k »t = 1 + [Et xt+1 ¡ Etm xt+1 ] 1¡¸ Notice that under rational expectations agents know that there is no transitory component in the xt forward premium (i.e., k = 1). Thus, the exchange rate follows: et = e¹t ¡ 1¡¸ and there are no predictable excess returns. The response at time t + j to an interest rate innovation " > 0 at time t is: ¸j " h i (7) et+j = e¹t ¡ 1 ¡ ¸(1 ¡ k)j+1 1¡¸ " » t+j = ¸j+1 (1 ¡ k)j+1 ¸ 0 1¡¸ From this, we see that (a) there are positive predictable excess returns when agents misperceive shocks to be more transitory than what they actually are (k < 1); (b) delayed overshooting at horizon ¿ (i.e., jet+¿ ¡ Et e¹t+¿ j > jet+¿ ¡1 ¡ Et e¹t+¿ ¡1 j) occurs if and only if: µ ¶ (1 ¡ ¸) =¸ (8) ¿ < ln = ln (1 ¡ k) ; 1 ¡ ¸ (1 ¡ k) and (c) the probability limit of ¯ in the Fama regression is always smaller than 1 and may even be negative: (1 + ¸) (1 ¡ ¸ (1 ¡ k)) ¸ (1 ¡ k) p lim ¯ = 1 ¡ ·1 1 ¡ (1 ¡ k) ¸2 Notice that there can be positive predictable excess returns on the domestic currency (» t+j ¸ 0), even if there is no delayed overshooting. Figure 1 shows the path of the exchange rate in response to an unanticipated decrease in the interest rate at t = 0: The exchange rate depreciates for about 10 periods before reverting back to its long run value. If one interprets each period as a week or a month, this graph resembles the impulse response functions estimated by Clarida and Gali (1994), Eichenbaum and Evans (1995) and Grilli and Roubini (1994). The duration of each period should depend on the frequency with which one believes that investors receive \new and relevant" information. We now describe the intuition behind these results. There are two e®ects: 9 ² Interest rate e®ect. After an initial downward jump, domestic interest rates follow an increasing path. This induces the exchange rate to experience an immediate depreciation followed by a gradual appreciation to ensure that uncovered interest parity holds. This e®ect is captured by the term in ¸j in (7): the speed at which the initial disturbance fades away. ² Learning e®ect. When the shock takes place at time t agents only observe a reduction in yt and gradually lower their belief about xt+j using updating equation (5). A downward revision m x of Et+j t+j+1 generates depreciating pressures on the exchange rate. This e®ect is captured by the term in (1 ¡ k)j+1 in (7). When there is no misperception (k = 1), this term vanishes. According to (8), a smaller ¸ (less persistence) increases the second term proportionally more than the ¯rst one, making delayed overshooting less likely. This means that an economy converging more rapidly to its long run equilibrium is less likely to exhibit delayed overshooting. The quicker convergence occurs, the more persistent shocks look like transitory ones. Thus, little weight is given to past observations, weakening the learning e®ect. Changes in k (the degree of misperception) have more complex e®ects. For a su±ciently large k, the learning process works e±ciently and beliefs converge to the true value of the persistent component of the interest rate very rapidly. As a consequence the subsequent upward revision of beliefs is very small. Therefore, the learning e®ect is dominated by the interest rate e®ect and there is no delayed overshooting. In other words, since beliefs have almost converged at time 0, market participants bid the exchange rate up until it is back on the full information rational expectations path. For su±ciently small k, learning occurs very slowly and interest rates convey little information about their persistent component. Thus, the market forecast Etm xt+1 increases very little at the time of the shock. Afterwards, although Etm xt+1 is updated upwards, the learning e®ect is too small to dominate the interest rate e®ect. We now turn to the issue of the length of time over which the exchange rate moves in the \wrong" direction. Condition (8) de¯nes a `delayed overshooting' region at horizon ¿ ; D¿ ; in terms of the parameters ¸ and k:13 Figure 2 reports the lower boundary of this delayed overshooting region as we vary ¿ from 1 to 10 periods. As we increase the peak date ¿ , the conditions on ¸ and k become more stringent: the frontier of D¿ shifts up, as seen in Figure 2. As expected, delayed 10 overshooting obtains for high persistence and low, but not too low, values of k: Figure 3 reports the contour plot of the coe±cient from the Fama regression as a function of ¸ and k: The Fama coe±cient can be negative for small (but not too small) values of k and large values of ¸: The contour plot reveals that the region for negative Fama coe±cients coincides roughly with the region for delayed overshooting. Note that the empirical evidence presented in Table 1 shows that the Fama coe±cients are increasing with the horizon. In fact, they are close to one for a number of currencies at horizons of 12 months. This indicates that predictable excess returns disappear as the horizon increases. Figure 4 shows that our model is consistent with these facts. It depicts the limit of the Fama coe±cient ¯ n as a function of the regression horizon n : et+n ¡ et = ® + ¯ n n rtn + ²t+n where rtn is the continuously compounded yield on a n ¡ period zero coupon bond. This ¯gure makes clear that in our model economy the Fama coe±cient can be substantially negative at short horizons, but that it eventually converges to one. To sum up, our analysis has strong cross-sectional implications. Countries should exhibit un- conditional delayed overshooting and the forward discount puzzle in its most extreme form (i.e., a negative Fama coe±cient), if (a) monetary shocks have high conditional persistence (¸), resulting, for instance, from a low interest elasticity of money demand, and (b) the degree of misperception (1 ¡ k) is high, but not too high. Further, our analysis indicates that there are always positive predictable excess returns at short horizons (» t > 0), even if there is no delayed overshooting. 2. An Empirical Exploration of Interest Rate Forecasts The previous section highlighted the key insight of the model: delayed overshooting and the forward premium puzzle can arise in equilibrium from a systematic under-reaction of interest rate forecasts. In this section, we demonstrate that this under-reaction is present in the data. More speci¯cally, using a state space model that provides the foundation for equation (5), we establish that interest rate forecasts systematically under-react to interest rate innovations. 11 2.1. Modelling the Interest Rate Di®erential. To characterize the interest rate process, we adopt the following state-space representation. The interest rate di®erential between any two countries (xt ) consists of a persistent (zt ) and a transitory (vt ) components, as well as a constant ¹:14 (9) xt = ¹ + zt + º t In addition, we assume that the persistent component, zt ; follows an AR(q) process: (10) ¸ (L) zt = ²t Pq i with ¸ (L) = 1¡ i=1 ¸i L . The transitory and persistent innovations are independent and normally distributed with mean 0 and variance ¾2º and ¾2 respectively. For future reference, we de¯ne the ¾2º 15 noise to signal ratio ´ = ¾2 . One possible justi¯cation for our interest rate representation lies in its °exibility: depending on the underlying parameters, this representation can accommodate an integrated process -where some of the roots of ¸ lie on the unit circle- as well as a white noise.16 A more structural interpretation is also possible. Following Dornbusch (1976), we can interpret the transitory shock º t as a relative velocity shock, and the persistent shock ²t as a permanent relative money supply shock. In the presence of sticky prices in the short run, a permanent reduction (increase) in the nominal money stock leads to an increase (reduction) in the domestic interest rate. As prices adjust slowly over time, real money supply increases and the interest rate declines gradually until it reaches its steady state value. This interpretation is consistent with the empirical ¯ndings of Eichenbaum and Evans (1995): an exogenous shock to the US money supply induces a persistent change in the US interest rate in the opposite direction.17 Lastly, we want to emphasize that º t and ²t can capture the uncertainty surrounding the conduct of monetary policy. Both the monetary policy target and the information set upon which Central Banks act are imperfectly known to the market.18 Thus, transitory shocks may arise when the Fed acts on inaccurate forecasts or to re°ect balance of power adjustments among the Open Market Committee members. Both elements are not observed by market participants who have then to infer the motivation behind recent policy decisions. 12 2.1.1. Maximum Likelihood Estimation of the Interest Rate Process. The system (9) -(10) can be estimated by Maximum Likelihood of the associated Kalman Filter. The procedure is standard and summarized in appendix A.3.19 Our data set consists of monthly observations of the 3 months eurorates for Canada, France, Germany, Italy, Japan and the U.K. evaluated against the 3 months eurodollar.20 The sample period is 1974:1 to 1995:12.21 The results are presented in Table 3, for various autoregressive orders. A quick glance through the table indicates that (a) there is a strong persistent component, already largely documented in the literature, and (b) there is no sign of transitory component, as measured by the noise to signal ratio ´. Innovations to the persistent component of interest rate di®erentials disappear extremely slowly. The long run autocorrelation ranges from 0:85 for France against the U.S., to 0:99 for Italy against the U.S. The short run autocorrelation is higher than one for the U.K., Germany and Japan against the U.S., indicating further deviations from equilibrium after the initial shock. The table also reports the results of two Phillips-Perron Zt tests of a unit root in interest rate di®erentials. The results indicate that we cannot reject the hypothesis of a unit root at conventional levels of signi¯cance for Canada, Germany Italy and Japan. Thus, the interest rate di®erential for those countries against the U.S. does not appear co-integrated. For all countries and all speci¯cations, the noise to signal ratio is not statistically signi¯cant.22 In only two cases (Italy-US AR1 and Canada-US AR4) is the constraint on the noise to signal ratio non-binding. It is interesting to note that in both cases the associated long run autocorrelation is also much higher than for other speci¯cations.23 2.1.2. Survey Data. To measure market forecasts, we use consensus data from the Finan- cial Times Currency Forecaster on Eurorates forecasts at 3, 6 and 12 months. The data is available monthly from August 1986 to October 1995. Contributors include multinational companies as well as forecasting services from major investment banks, i.e., the most active players on the ¯xed in- come and foreign exchange markets.24 The monthly publication collects interest rates and their forecasts and reports a \market average" weighting individual forecasts according to their relative importance. This data set is unique in its coverage and consistency. We have not found any other source of interest rate forecasts prior to 1986 covering all G-7 countries. While survey data on 13 monthly money market rates are available for the US, and have been used in previous studies, we were unable to ¯nd similar survey forecasts for foreign countries.25 As an alternative to market fore- casts, one could use forward rates implicit in the term structure. This would be incorrect however, if the term structure contains a signi¯cant time varying component. In the working paper version of this paper, we replicate Froot (1989)'s results and demonstrate that this is indeed the case. Thus, we conclude that it is better to use direct consensus forecasts to measure market expectations. One should still be cautious when using survey data. First, there is probably substantial hetero- geneity in forecasts.26 Nonetheless, \market expectations" constructed from individual heteroge- nous forecasts may possess better statistical properties than individual forecasts if the idiosyncratic components \wash out" in the aggregation process. This is not guaranteed. A recent theoretical literature has emphasized that there may be systematic biases in individual forecasts: forecasters who care about their reputation, may have incentives to use forecasts in order to manipulate their clients's belief regarding their ability. Such reputational e®ects are likely to be stronger for pro- fessional forecasters than disinterested parties. The direction of the bias, however, is unclear and depends on the information as well as the payo® structure. Scharfstein and Stein (1990) develop a model where managers have an incentive to mimic the behavior of previous managers, while in Zwiebel (1995)'s model average managers have an incentive to herd while \extreme" types - either good or bad- have incentives to scatter.27 The empirical importance and direction of such reputational biases remains an open question.28 We assume that agents use a linear forecasting formula, as summarized by the Kalman ¯lter equations associated with the state-space representation in equations (9)-(10). We will estimate the parameters of the ¯lter implicitly used by market participants, which we call the \market ¯lter", as opposed to the parameters of the true data generating process. For a given market ¯lter we ³ ´ ³n op ´ can construct the associated forecast at horizon ¿ : x¿t ~µ ; where ~µ = ~i ¸ ~ 2 denotes the ; ´~; ¾ i=1 parameters of the market ¯lter. The forecasts constructed in such a way use only information up ³ ´ ^¿t = x¿t ~µ + À¿t where to time t. We assume that the observed forecasts are reported with error: x the measurement error is i.i.d. and independent of the true forecast, and estimate ~µ by Maximum Likelihood. The results are presented in Table 4 separately for each forecast horizon (3, 6 and 12 months, as well as for all forecast horizon pooled. 14 P ~ Our methodology allows us to distinguish between the conditional persistence ( j ¸j in terms ¾ 2º of our representation), and the relative importance of transitory and persistent shocks (~ ´= ¾ 2 ). As can be seen in Tables 3 and 4, conditional persistence is higher for the market ¯lter than for the data generating process. While the long run autocorrelation is close to 0:9 in the data, the market estimates a much higher long run autocorrelation sometimes very close to 1:29 This extra persistence is compensated by a large estimate for the relative variance ´~. The results in Table 4 indicate that for almost all countries and speci¯cations, the perceived noise-to-signal ratio ´~ is large -and often signi¯cant.30 These results stand in sharp contrast with our ¯nding that in the actual data the transitory component is zero. It implies that market forecasts systematically under-react to current interest rate changes. We now summarize the results of this section. Stylized Fact For the interest rate di®erentials between the US and the other G7 countries, during the period 1986-1995: 1. Persistent shocks are more frequent in sample than perceived by market participants. In other words, the noise-to-signal ratio ´ (i.e., the ratio of the variance of transitory shocks to the variance of permanents shocks) of the data generating process is signi¯cantly smaller than that implied by the linear ¯lter that best replicates interest rate forecasts. This indicates a systematic initial under-reaction to interest rate innovations. 2. Conditional persistence, as measured by the long run auto correlation of the persistent component, is higher for the market ¯lter than for the data generating process. This indicates an over-reaction to interest rate innovations at longer horizons. 3. An Affine Model of Exchange Rate Determination This section presents an a±ne model of exchange rate determination under complete markets, similar to Backus et al. (1998), in which there is imperfect information. Our objective is to connect the forward premium forecast under-reaction to the exchange rate anomalies. The economy consists of two countries, each with its own currency. In each country, nominal 15 assets are traded. The absence of pure arbitrage opportunities implies the existence of a positive pricing kernel for claims in domestic currency, Rt;t+1 ; such that: (11) 1 = Etm fRt;t+1 Rt+1 g where Rt+1 is the rate of return -in domestic currency- on any traded asset in the economy between t and t + 1. If the economy admits a representative agent, the pricing kernel is the nominal intertemporal marginal rate of substitution and (11) corresponds to the ¯rst-order condition of the agent's program. Equation (11) allows for market expectations Etm f:g to di®er from their rational counterpart, denoted Et f:g : Similarly, there exists a nominal pricing kernel for claims expressed in ¤ foreign currency, Rt;t+1 ; that satis¯es: n o 1 = Etm R¤t;t+1 Rt+1 ¤ Using the pricing kernels, we can derive the one-period continuously compounded nominal risk free rates as: n o (12) rt = ¡ log Etm fRt;t+1 g ; rt¤ = ¡ log Etm R¤t;t+1 To derive the rate of currency depreciation from the model, denote by Et the domestic price of foreign currency (so that an increase in Et corresponds to a depreciation of the domestic currency). Under complete markets, and with identical preferences across countries, the pricing kernel is unique (see Du±e (1996)). Since we can construct a pricing kernel for nominal claims in the ¤ Et Et+1 ¤ 31 domestic currency as Rt;t+1 Et+1 ; unicity implies that Rt;t+1 Et = Rt;t+1 in all states of nature. Expected depreciation is then: (13) Etm et+1 ¡ et = Etm log R¤t;t+1 ¡ Etm log Rt;t+1 Substituting for the domestic and foreign nominal interest rates, we obtain: Etm et+1 ¡ et = rt ¡ rt¤ + ³ t ³ n o´ where ³ t = Etm log Rt;t+1 ¤ ¡ log Etm Rt;t+1 ¤ ¡ (Etm log Rt;t+1 ¡ log Etm fRt;t+1 g) The time-varying risk premium ³ t re°ects the di®erence between conditional means of the pricing kernels. We now follow the literature on a±ne-yield models and assume that the pricing kernels follow log-normal processes: 16 ¹ 2 ¾2 ' '2 ¾ 2 (14) ln Rt;t+1 = ¡ ln R ¡ ¡ ± z¹t ¡ ¡ zt ¡ ' ¹ ¹²t+1 ¡ '²t+1 2 2 ¹ ¤2 ¾2 ' '¤2 ¾ ¤2 (15) ln R¤t;t+1 = ¡ ln R ¡ ¡ ± ¤ z¹t ¡ ¡ zt¤ ¡ '¹ ¤ ¹²t+1 ¡ '¤ ²¤t+1 2 2 ¡ ¢ where the elements of ²t+1 = ¹²t+1 ; ²t+1 ; ²¤t+1 0 are independent and normally distributed with mean 0 and variance ¾2 . The positive parameters ±; ± ¤ ; '; '¤ ; ' ¹ ¤ represent the loadings on the ¹ and ' various shocks while R is a positive scalar. z¹t and ¹²t+1 represent, respectively, the predictable ¡ ¢ and unpredictable components of a shock common to both countries, while (zt ; ²t+1 ) and zt¤ ; ²¤t+1 represent the predictable and unpredictable components of country speci¯c shocks. We also assume zt ; zt ; zt¤ )0 obeys the following AR process: that the state zt = (¹ (16) zt+1 = ¸ zt + ²t+1 The speci¯cation in (14)-(16) encompasses a broad class of processes while maintaining ana- lytical tractability. Together, zt and ²t+1 span the predictable and unpredictable components of domestic and foreign bonds and currency price movements. The ''s determine the correlation between each state variable's innovation and the pricing kernel. Under (14)-(15), the equilibrium interest rate and depreciation rates satisfy: (17) rt = ln R + H0 zt ¹2 ¡ ' ' ¹ ¤2 + '2 ¡ '¤2 2 Etm fet+1 g ¡ et = (± ¡ ± ¤ ) z¹t + zt ¡ zt¤ + ¾ 2 (18) = rt ¡ rt¤ + ³ 0 1 B ± 1 0 C where H0 = @ A and r0t = (rt ; rt¤ ) : ¤ ± 0 1 In each country, the domestic interest rate is a linear combination of the world and country speci¯c factors. The expected depreciation rate depends upon the world factor only insofar as it a®ects domestic and foreign interest rates di®erently. When ± = ± ¤ ; the common factor does not a®ect expected depreciation because it shifts domestic and foreign interest rates by the same amounts. We make this assumption in what follows.32 The risk premium ³ is constant and depends upon the variance of the innovations and the loading factors. Since this risk premium is not time-varying, it is irrelevant for our analysis. For 17 simplicity, we set it to 0 by assuming ' ¹ ¤ = ' = '¤ ; implying that uncovered interest parity ¹ =' holds exactly, under market expectations. By ruling out risk premia altogether, this assumption greatly simpli¯es our analysis.33 The previous assumptions imply that rt follows an AR(1) process: (19) rt+1 = (1 ¡ ¸) ln R + ¸ rt + H0 ²t+1 This model has the simplicity of a±ne models of the term structure of interest rates. This is not a coincidence: we have assumed a process for the pricing kernel that is identical to Vasicek's original pricing kernel. 3.1. Market Expectations. As discussed in the previous section, the literature on the rational expectation hypothesis of the term structure of interest rates documents systematic deviations between forward rates and expected future short rates. We take this under-reaction as the starting point of our analysis of market expectations and use a convenient state-space representation to characterize the perceived process for the interest rate process, that is consistent with our empirical speci¯cation. Assume that when making forecasts agent do not use the correct pricing kernels in (14) and (15). Instead, they perceive that the pricing kernels satisfy: (20) ~ t;t+1 = ln Rt;t+1 ¡ vt ln R (21) ~ t;t+1 ln R ¤ ¤ = ln Rt;t+1 ¡ vt¤ where vt and vt¤ are independent normally distributed shocks with mean 0 and variance ¾2v : The perceived pricing kernels di®er from the correct ones by a purely transitory shock. It is important to observe that interest and exchange rates are still determined by (12) and (13). This implies that agents perceive the following relationship between short rates and fundamentals: (22) rt = ln R + H0 zt + vt Economically, (22) indicates that agents believe that interest rates contain a persistent component, H0 zt ; and a transitory one, vt ; while in fact interest rates contain only a persistent one {they follow 18 (17). In all other respects, expectations are rational. In particular, rational expectations obtain as the special case where ¾ 2v = 0: Equation (22) together with (16) form a state space representation for the interest rate process perceived by agents. As we shall see, this representation captures very naturally the stylized fact that interest rate forecasts under-react to innovations in interest rates. The question of interest to us is the extent to which this misperception a®ects exchange rate determination in equilibrium. 3.2. The Learning Problem. Agents form optimal forecasts of future interest rates given their beliefs summarized in (16)-(22). This is a standard normal-linear ¯ltering problem. The solution is given by the Kalman ¯lter (see the appendix for derivations). According to (22), agents form forecasts of future interest rates according to: Etm rt+1 = ln R + H0 Etm zt+1 n o De¯ne Pt+1 = Etm (zt+1 ¡ Etm zt+1 )2 ; the conditional variance of the market belief. The following lemma is a direct consequence of the properties of the Kalman ¯lter. ³ ´ Lemma 1. Assume that beliefs about z1 are initially distributed as N Etm z1j0 ; P1 where Etm z1j0 and P1 are an appropriate vector and matrix respectively. Then: 1. Beliefs evolve according to: ³ ´¡1 ¡ ¢ Etm zt+1 = ¸Et¡1 m zt + ¸Pt H H0 Pt H + ¾2v I rt ¡ log R ¡ H0 Et¡1 m zt 2. The conditional variance Pt evolves according to: · ³ ´¡1 ¸ 2 0 Pt+1 = ¸ Pt ¡ Pt H H Pt H + ¾2v I H0 Pt + ¾2 I in particular, it does not depend upon the actual realizations of the interest rates rt : 3. In the limit as t ! 1; the conditional variance converges to a steady state value P; solution of: · ³ ´¡1 ¸ P = ¸2 P ¡ PH H0 PH + ¾2v I H0 P + ¾ 2 I and the beliefs evolve according to: ³ ´¡1 ¡ ¢ (23) Etm zt+1 = ¸Et¡1 m zt + ¸PH H0 PH + ¾2v I rt ¡ log R ¡ H0 Et¡1 m zt 19 In what follows, we assume that the process has been going on for long enough so that Pt+1 has converged. Two special cases are of interest. First, when expectations are rational, ¾2v = 0; (23) collapses to: Etm rt+1 = ¸rt , as expected. Second, when there is no common shock, ± = 0; interest rates depend only upon their country speci¯c factor and are independent from one another. In this case (23) implies: (24) Etm rt+1 = ¸ (1 ¡ k) Et¡1 m rt + ¸k rt + (1 ¡ ¸) ln R where k, the gain of the ¯lter, measures how much weight is given to new observations, relative to past expectations. Equation (24) makes clear that expected future short rates under-react to changes in the short rate when k < 1: In steady state, the gain of the ¯lter is given by: k = 1+¢¡´ (1¡¸2 ) h ³ ´ i2 ¾2v 2 = ´ 1 ¡ ¸2 + 1 + 1+¢+´ (1+¸ )2 · 1; where ´ = ¾ 2 is the perceived noise-to-signal ratio and ¢ 2 4 ´ ¸ . It follows that the gain depends only on the perceived relative variances of the noise and signal components (´) and the degree of persistence (¸). The gain is zero and no learning occurs when the noise is in¯nite while learning is immediate when there is no noise {i.e., expectations are rational: The gain decreases monotonically with the noise-to-signal ratio and increases with persistence. Intuitively, with a higher ¸, today's interest rates contain more information about the persistent component of interest rates and the current realization of the interest rate gets more weight. Given ¸; there is a one-to-one mapping between the agent's misperception -as measured by ´¡ and the weight given to past beliefs. We can thus indi®erently analyze the properties of the system in terms of (¸; ´) or in terms of (¸; k) : In the general case, ± 6= 0, foreign interest rate observations convey information about z¹t ; the common factor. If we pre-multiply (23) by H0 ; we obtain a formula similar to (24): (25) Etm rt+1 = ¸ (I ¡ K) Et¡1 m rt + ¸K rt + (1 ¡ ¸) ln R ¡ ¢¡1 where K = H0 PH H0 PH + ¾2v I is the matrix representation of the gain of the ¯lter. The formula for the gain indicates that, in general, forecasts of future short rates depend on realizations of both the domestic and foreign short rates. However, a generalized version of under-reaction obtains as the diagonal elements of K are strictly smaller than 1 as long as ¾2v di®ers from 0: 20 3.3. Equilibrium Exchange Rate. The equilibrium exchange rate is obtained by solving forward the uncovered interest parity condition (18): TX ¡1 ³ ´ et = Etm rt+j ¤ ¡ rt+j + Etm et+T j=0 If we de¯ne the equilibrium long-run value of the exchange rate, e¹t ; as limT !1 Etm et+T ; and sub- stitute the market beliefs by (25), we obtain: 1 0 0 m 1 (26) et = e¹t ¡ l0 rt ¡ l H Et zt+1 = e¹t ¡ xt ¡ E m xt+1 1¡¸ 1¡¸ t where l0 = (1; ¡1) ; so that xt = l0 rt = rt ¡ rt¤ is the forward premium. The exchange rate depends only upon the current forward premium xt ; its future expected value Etm xt+1 and on e¹t : We do not focus in this paper on the determinants of e¹t . However, we assume that there is no misperception regarding its expectations: Etm e¹t+1 = Et e¹t+1 = e¹t : Hence, long run misperceptions do not contribute to predictable excess returns. This assumption implies that there are no predictable excess returns over long horizons. Indeed the data presented in Table 1 indicates fewer predictable excess returns at longer horizons. Under rational expectations the exchange rate follows: xt (27) ert = e¹t ¡ 1¡¸ Substracting (27) from (26), we can express the equilibrium exchange rate as the rational expecta- tion exchange rate plus a term that re°ects misperception of the interest rate process: 1 (28) et = ert + (Et xt+1 ¡ Etm xt+1 ) 1¡¸ 3.4. Foreign Exchange Market Anomalies. In this subsection we demonstrate that our simple model is rich enough to rationalize both Eichenbaum and Evans' delayed overshooting puzzle as well as Fama's forward premium puzzle, for some con¯gurations of the parameters. 3.4.1. Predictable excess returns. Recall that predictable excess returns on the domestic currency are de¯ned as: » t = (Etm et+1 ¡ Et et+1 ) ¡ ³ t 21 where ³ t is the risk premium. Since there is no risk premium in our set-up, predictable excess returns originate exclusively from forecast errors. Using (25) and (26): µ ¶ 0 ¸K (29) »t = l I+ (Et rt+1 ¡ Etm rt+1 ) 1¡¸ Predictable excess returns depend linearly upon the misperception in short term interest rates forecasts. The relationship between interest rate forecasts and predictable excess returns is complex since the matrix K is not diagonal in general. To gain some intuition we return to our two special cases. First, when expectations are rational, » t = 0, as expected. Second, in the absence of common factors (± = 0) ; (29) simpli¯es to: µ ¶ ¸k (30) »t = 1 + (Et xt+1 ¡ Etm xt+1 ) 1¡¸ When the expected forward premium is lower than predicted according to the true model (Et xt+1 ¡ Etm xt+1 > 0) ; there are positive excess returns on the domestic currency. The reason is simple: if future forward premia are under-estimated, the currency is arti¯cially depreciated (see (28)) and will subsequently appreciate. Using (24), we can write predictable excess returns in a recursive form in this special case: µ ¶ ¸k (31) » t = ¸ (1 ¡ k) » t¡1 + 1 + ¸ (1 ¡ k) l0 H0 ²t 1¡¸ According to (31) persistence in predictable excess returns increases with the degree of mispercep- tion, measured by 1 ¡ k: 3.4.2. Fama regression and the forward discount puzzle. Here we show how the under-reaction of forward premium forecasts is linked to the forward discount puzzle. Recall that the Fama coe±cient ¯ converges to: cov (et+1 ¡ et ; xt ) p lim ¯ = var (xt ) We prove in the appendix the following result. Lemma 2. The coe±cient of the regression of realized depreciation rates on the forward premium converges in plim to ³ ´ µ ³ ´¡1 ¶ ¸K 2 2 ¸l0 I+ 1¡¸ (I ¡ K) I ¡ I ¡ ¸ (I ¡ K) ¸ K var (rt ) l (32) p lim ¯ = 1 ¡ l0 var (rt ) l 22 It is immediate to check that ¯ = 1 when expectations are rational (since K = I): In the case where there are no common shocks (± = 0), (32) simpli¯es to ¸ (1 ¡ ¸ (1 ¡ k)) (1 ¡ k) (1 + ¸) (33) p lim ¯ = 1 ¡ 1 ¡ ¸2 (1¡k) Since k < 1 and ¸ < 1; the Fama coe±cient is always smaller than 1 and can even be negative. In the general case the Fama coe±cient (32) is a complex function of the parameters of the model and may be signi¯cantly di®erent from 1. Figure 3 reports plots the contour of ¯ as a function of ¸ and k: We see from the graph that ¯ falls as the shocks become more persistent (¸ increases) : The dependence on k is more complex. A low ¯ requires a low k: However, when k = 0; which corresponds to an environment where agents believe all shocks are purely transitory, ¯ = 1 ¡ ¸ and remains strictly positive. Indeed, the minimum of ¯ is attained for small but strictly positive values of k. Figure 3 also reports the point estimates for ¸ and k for each country pair obtained indirectly from our empirical estimates of (~ ´ ~ obtained in section 2.34 and ¸) We can see that for all currencies against the dollar, the process for interest rate forecasts is consistent with very low Fama coe±cients (below 0.2 and even negative for Japan and Germany). We emphasize again that we have not used any exchange rate data in this calculation. Figure 4 reports the coe±cient from a Fama regression at horizon n (¯ n ): Two results emerge. First, the Fama coe±cient can be very low -indeed even negative, at short horizons where the e®ect of misperception is strongest. At longer horizons, however, the Fama coe±cient converges to one. Second, ¯ n can vary non-monotonically with the horizon, as the learning and interest rate e®ects interact in complex ways. It is easy to understand why ¯ must be smaller than 1. By de¯nition, expected depreciation is the di®erence between the forward premium and predictable excess returns: Et et+1 ¡ et = xt ¡ » t : m x so that: But from (30), we know that » t is a function of xt and Et¡1 t µ ¶ ¸k ¡ m ¢ (34) Et et+1 ¡ et = xt ¡ ¸ 1 + (1 ¡ k) xt ¡ Et¡1 xt 1¡¸ where we have used (30) as well as (25). Consider now an increase in the forward premium xt at m x does not increase as much as x and the time t: Since interest rate forecasts under-react, Et¡1 t t 23 contribution of the predictable excess return is negative. Hence the expected depreciation adjusts by less than the forward premium, implying a coe±cient ¯ smaller than 1. An example might help here. Suppose there is an increase in domestic interest rates, holding foreign interest rates constant. Since agents underestimate future domestic interest rates, the currency will initially appreciate less than under rational expectations (see equation (26)). Thus, the rationally expected depreciation is smaller than that implied by the forward discount, and there are positive predictable excess returns. This is true whenever ¸ > 0 and k < 1:35 Yet, we also know from Figure 3 that ¯ can be negative, which is a much stronger result. How can this be? To understand what is going on, return to equation (34). When ¸ increases, the ³ ´ ¸k coe±cient on the forward premium forecast error, ¸ 1 + 1¡¸ (1 ¡ k) ; can exceed the coe±cient on the forward premium (1) ; as predictable excess returns become more volatile. To illustrate what is going on, consider again an increase in domestic interest rates holding foreign interest rates constant . For ¸ large enough; the initial mispricing of the exchange rate is so large that it requires the exchange rate to appreciate further in the future. As agents update their beliefs about the domestic interest rate, they realize the change in interest rates is more persistent than initially anticipated. Any upward revision on the persistent component of interest rates has a large e®ect on exchange rates since agents expect high domestic interest rates to persist in the future. Arbitrage implies that the domestic currency appreciates. This scenario is more extreme: a high domestic interest rate co-exists with an appreciating currency. In other words, the forward premium and rationally expected depreciation move in opposite directions. Thus, ¯ is negative. Our previous m x : But past expected forward premia are correlated with discussion abstracted from the term Et¡1 t the current forward premium. Indeed, one obtains (33) exactly when the correlation is properly taken into account. This term dampens the movements in predictable excess returns and makes it more di±cult for ¯ to turn negative (compare the term in xt in (34) and the Fama coe±cient). 3.4.3. Delayed Overshooting. The delayed overshooting path is characterized by Eichen- baum and Evans (1995) as the impulse response of the exchange rate to an unanticipated monetary shock. In the context of our model, we compute the path that the exchange rate would follow if an innovation ² were to take place at time t, followed by no other shocks. This shock may be an 24 innovation to the common factor z¹ or to a combination of the country speci¯c factors z and z ¤ : For simplicity, assume that we start from steady state with rt¡1 = ln R: According to (19), short rates follow: rt+j = ln R + ¸j H0 ² Under rational expectation, the exchange rate obeys (27) and follows: ¸j et+j = e¹t+j ¡ (² ¡ ²¤ ) 1¡¸ The exchange rate overshoots and converges back to its equilibrium value at the same speed as the interest rate rate. When forward premium forecasts under-react, the exchange rate follows instead: ¸j l0 h i (35) et+j = e¹t+j ¡ I ¡ ¸ (I ¡ K)j+1 H0 ² 1¡¸ The term H0 ² determines how each innovation in°uences the domestic and foreign interest rates, respectively. Recall that common monetary shocks z¹ do not in°uence the rate of depreciation expected by agents when ± = ± ¤ . One can check that (35) simpli¯es to: ¸j h ³ ´i (36) et+j = e¹t+j ¡ 1 ¡ ¸ ¼j+1 j+1 11 ¡ ¼ 21 (² ¡ ²¤ ) 1¡¸ where ¼j+1 il is the (i; l) element of (I ¡ K)j+1 : Equation (36) makes clear that et+j is also una®ected by common shocks. In the special case where there are no common shocks, the formula simpli¯es to (7). Delayed overshooting occurs at horizon ¿ when jet+¿ ¡ Etm e¹t+¿ j > jet+¿ ¡1 ¡ Etm e¹t+¿ ¡1 j : Using (36), and the properties of the learning process, we have the following lemma: Lemma 3. A necessary and su±cient condition for delayed overshooting after ¿ periods is: 1¡¸ (1 ¡ ¸ (¼11 ¡ ¼21 )) (¼¿11 ¡ ¼¿21 ) > ¸ where ¼ij and ¼¿ij are de¯ned respectively the (i; j) element of the matrices I ¡ K and (I ¡ K)¿ : This lemma establishes that delayed overshooting depends upon the parameters driving the learning and interest rate processes as summarized by K and ¸: The discussion of Section 1 applies: delayed overshooting results from the interaction between an interest rate and a learning e®ect. We 25 obtain (8) as a special case when ± = ± ¤ = 0: In this special case the delayed overshooting region is reported in Figure 2. Figure 2 also reports the implicit delayed overshooting horizon for each country vis a vis the US.36 The results indicate that, except for the French Franc, all currencies exhibit delayed over- shooting between 3 and 8 months. While the point estimates are smaller than the 24-36 months delayed overshooting typically reported by Eichenbaum and Evans (1995), they nonetheless indicate that interest rate forecasts are consistent with delayed overshooting for some currency pairs. 4. Conclusion In this paper we establish a link between currency market and bond market anomalies. We show that the data strongly supports the existence of under-reaction of bond prices to changes in short rates. We then develop a nominal exchange rate determination model in which there is incomplete information about the nature of interest rate shocks (transitory vs. persistent). We demonstrate that for typical empirical estimates of the misperception, the equilibrium exchange rate in the model exhibits both delayed overshooting and the forward premium puzzle in its most extreme form {i.e., a negative Fama coe±cient. We see this attempt to investigate whether anomalies in di®erent asset markets are consistent with each other as a novel contribution that might be of interest in areas other than international ¯nance. The dynamics of the exchange rate are determined by two e®ects which act in opposite direc- tions: an interest rate e®ect and a learning e®ect. The ¯rst e®ect is standard. As in other models, it derives from the mean reverting nature of interest rates di®erentials. The learning e®ect is novel, and derives from the fact that agents constantly learn whether interest rate shocks are transitory or persistent. The predictable excess returns and hump-shaped dynamics, implied by the two cur- rency market anomalies, can result if and only if agents misperceive interest rate shocks to be more transitory than what they actually are. The key to our results is the existence of a misperception about second moments of the interest rate process, not about ¯rst moments. Furthermore, currency market anomalies in our model economy only exist in the short run. At long horizons predictable excess returns vanish, and the 26 Fama coe±cient converges to one. This is in line with the empirical evidence. This interpretation, which is new to our knowledge, has important implications. First, it pro- vides a clear analytical characterization of the factors in°uencing exchange rate responses to mon- etary shocks. Countries with rapidly converging interest rates, due to either fast moving prices or a large interest elasticity of money demand, will experience smaller predictable excess returns. Countries with either a very small or a very large variance of transitory shocks will also converge without delayed overshooting. In the former case because learning occurs fast, in the latter case because learning does not have a signi¯cant e®ect on the demand for assets. This paper also serves the useful purpose of uncovering a deeper rationale for a variety of asset market pathologies. In particular, the misperceptions we identify raise the interesting prospect of an integrated understanding of currency and bond markets. Of course, this paper also raises some intriguing questions: why do agents fail to revise their erroneous beliefs about second moments? Can these misperceptions be arbitraged away or taken advantage of by savvier investors? Ultimately, of course, we will need to reconcile observed behavior with models of optimal behavior. While we may not be there yet, this paper indicates a promising avenue of research. 27 References Backus, David, Silverio Foresi, and Chris Telmer, \A±ne Models of Currency Pricing: Accounting for the Forward Premium Anomaly," July 1998. mimeo New York University. Barberis, Nicholas, Andrei Shleifer, and Robert Vishny, \A Model of Investor Sentiment," Journal of Financial Economics, September 1998, 49 (3), 307{43. Batten, Dallas, Michael Blackwell, In-Su Kim, Simon Nocera, and Yuzuru Ozeki, \The Conduct of Monetary Policy in the Major Industrial Countries," International Monetary Fund, Washington DC 1990. Occasional Paper 70. Bernard, Victor, \Stock Price Reactions to Earnings Announcements," in Richard Thaler, ed., Advances in Behavioral Finance, Russell Sage Foundation, NY 1992. Bollerslev, Tim, Ray Chou, and Kenneth Kroner, \ARCH Modelling in Finance: A Review of the Theory and Empirical Evidence," Journal of Econometrics, 1992, 52, 5{59. Campbell, John and Robert Shiller, \Yield Spreads and Interest Rate Movements: a Bird's Eye View," Review of Economic Studies, May 1991, 58 (3), 495{514. Chan, Louis, Narasimhan Jegadeesh, and Josef Lakonishok, \Evaluating the Performance of Value versus Glamour Stocks: the Impact of selection Bias," Journal of Financial Economics, Jul 1995, 38 (3), 269{296. Clarida, Richard and Jordi Gali, \Sources of Real Exchange Rate Fluctuations: How Important Are Nominal Shocks?," 1994. NBER Working Paper 4658. Cutler, David, James Poterba, and Lawrence Summers, \Speculative Dynamics," Review of Economic Studies, May 1991, 58 (3), 529{546. DeBondt, Werner and Richard Thaler, \Does the Stock Market Overreact?," Journal of Finance, 1985, 40, 793{808. Dornbusch, Rudiger, \Expectations and Exchange Rate Dynamics," Journal of Political Econ- omy, 1976, 84, 1161{1176. 28 Du±e, Darrell, Dynamic Asset Pricing Theory, Princeton University Press, 1996. Ehrbeck, Tilman and Robert Waldmann, \Why Are Professional Forecasters Biased? Agency Versus Behavioral Explanations," Quarterly Journal of Economics, Feb 1996, 111 (1), 21{40. Eichenbaum, Martin and Charles Evans, \Some Empirical Evidence on the E®ects of Mon- etary Policy Shocks on Exchange Rates," Quarterly Journal of Economics, Nov 1995, 110, 975{1009. Fama, Eugene, \Forward and Spot Exchange Rates," Journal of Monetary Economics, 1984, 14 (4), 319{338. and Kenneth French, \The Cross-Section of Expected Stock Returns," Journal of Finance, Jun 1992, 47 (2), 427{465. Frankel, Je®rey and Andy Rose, \A Survey of Empirical Research on Nominal Exchange Rates," 1994. NBER Working Paper 4865. and Kenneth Froot, \Forward Discount Bias: Is it an Exchange Risk Premium?," Quar- terly Journal of Economics, 1989, 104, 139{161. Friedman, Benjamin, \Interest Rate Expectations versus Forward Rates: Evidence from an Expectations Survey," Journal of Finance, 1980, 34, 965{973. Froot, Kenneth, \New Hope for the Expectations Hypothesis of the Term Structure of Interest Rates," Journal of Finance, 1989, 44 (2), 283{304. Grilli, Vittorio and Nouriel Roubini, \Liquidity and Exchange Rates: Puzzling Evidence From the G-7 Countries," 1994. mimeo Yale University. Hamilton, James, Time Series Analysis, Princeton University Press, 1994. Hodrick, Robert, \The Empirical Evidence on the E±ciency of Forward and Futures Foreign Exchange Markets," in \Fundamentals of Pure and Applied Economics" Harwood Academic Publishers 1988. 29 Hong, Harrison and Jeremy Stein, \A Uni¯ed Theory of Underreaction, Momentum Trading and Overreaction in Asset Markets," Journal of Finance, December 1999, 54 (6), 2143{84. Jegadeesh, Narasimhan and Sheridan Titman, \Returns to Buying Winners and Selling Losers: Implications for Stock Market E±ciency," Journal of Finance, 1993, 48, 65{91. Kaminski, Graciela, \Is There a Peso Problem? Evidence form the Dollar/Pound Exchange Rate, 1976-1987," American Economic Review, 1993, 83, 450{472. Lewis, Karen, \Can Learning A®ect Exchange Rate Behavior? The Case of the Dollar in the Early 1980's," Journal of Monetary Economics, 1989, 23, 79{100. , \Changing Beliefs and Systematic Rational Forecast Errors with Evidence from Foreign Exchange," American Economic Review, 1989, 79, 621{636. , \Puzzles in International Finance," 1994. NBER Working Paper 4951. Muth, John, \Optimal Properties of Exponentially Weighted Forecasts," Journal of the American Statistical Association, 1960, 55, 299{306. Prendergast, Canice and Lars Stole, \Impetuous Youngsters and Jaded Oldtimers: Acquiring a Reputation for Learning," 1996. forthcoming @JPE. Sa'a-Requejo, Jesus, \The Dynamics and the Term Structure of Risk Premia in Foreign Exchange Markets," May 1994. mimeo INSEAD. Scharfstein, David and Jeremy Stein, \Herd Behavior and Investment," American Economic Review, 1990, 80, 464{79. Zarnowitz, Victor and Phillip Braun, \Twenty Two Years of the NBER-ASA Quarterly Eco- nomic Outlook Surveys: Aspects and Comparisons of Forecasting Performance," 1992. NBER Working Paper 3965. Zwiebel, Je®rey, \Corporate Conservatism and Relative Compensation," Journal of Political Economy, 1995, 103, 1{25. 30 Table 1: Test of Forward Premium Puzzle Estimates the following: et+n ¡ et = ® + ¯ n xnt + ´t+n . et+n ¡ et is the n-period realized depreciation rate vis a vis the US dollar. xnt is the n- period continuously compounded euro-rate and xnt = rtn ¡ rtn;us is the forward premium. The number of observations varies for each country pair between 100 and 299. Coe±cients estimated with OLS. Newey-West (1987) Robust standard errors in parentheses. The last column reports the p-value for the test ¯ n = 1. Source: DRI. Exchange Rate from IFS; Interest Rates: Eurorates from FACS. Country Sample ¯n SE p Sample ¯n SE p horizon: 3 months horizon: 6 months All 74:1-99:7 -0.72 0.22 0.00 74:1-99:4 -0.64 0.18 0.00 Canada 79:8-99:7 -0.87 0.40 0.00 79:8-99:4 -0.75 0.42 0.00 France 74:1-98:9 -0.16 0.85 0.17 74:1-98:6 -0.11 0.86 0.19 Germany 74:1-98:9 -0.50 0.73 0.04 74:1-98:6 -0.47 0.70 0.04 Italy 80:10-98:9 1.41 1.27 0.75 80:10-98:6 1.59 1.45 0.59 Japan 78:1-99:7 -2.98 0.82 0.00 78:1-99:4 -3.06 0.72 0.00 U.K. 74:1-99:7 -1.25 0.95 0.02 74:1-99:4 -1.00 0.88 0.02 horizon: 12 months All 74:1-97:10 0.93 0.13 0.58 Canada 78:1-98:10 0.70 0.26 0.25 France 74:1-97:12 0.49 0.71 0.47 Germany 80:10-97:12 1.39 0.43 0.36 Italy 79:8-97:12 0.64 1.20 0.77 Japan 86:8-95:10 2.58 0.93 1.71 U.K. 74:1-98:10 0.82 0.37 0.63 31 Table 2: Decomposition of the Fama Coefficient Decomposes ¯ n according to: ¯ n = 1 ¡ bre ¡ brp . cov(un ;xn t ) var(³ n m n t )+cov(Et et+n ¡et ;³ t ) with bre = ¡ vart+n , brp = (xnt ) var(xnt ) Country bre brp ¯n bre brp ¯n horizon: 3 months horizon: 6 months All 0.81 0.19 -0.05 0.93 0.08 0.15 Canada 3.03 -0.20 -1.83 2.48 -0.59 -0.89 France 1.51 -1.87 1.35 1.03 -1.55 1.51 Germany 2.30 -1.55 0.25 2.20 -1.58 0.38 Italy -0.50 -1.70 3.20 -0.72 -1.22 2.94 Japan 4.91 -0.33 -3.58 4.93 -0.70 -3.23 U.K. 2.82 0.13 -1.95 2.54 -0.52 -1.02 horizon: 12 months All -0.98 0.92 1.06 Canada -1.83 1.47 1.36 France 0.03 -0.15 1.11 Germany -0.78 0.98 0.80 Italy -2.10 2.41 0.7 Japan -1.72 1.26 1.45 U.K. -0.99 0.90 1.08 32 Table 3: Maximum Likelihood Estimation of the State-Space Representation of 3 Months Eurorates Estimates the following state-space model on monthly data: xt = ¹+ zt + º t ; ¸ (L) zt = ²t . xt is the 3 months eurorate di®erential rt ¡rtus . We report estimates of the autoregressive ¾ 2º coe±cients (¸) and the noise to signal ratio ´ = ¾ 2² . Sample period: 1974:1 to 1995:12. The number of observations varies for each country pair. Heteroskedasticity consistent standard errors (SE) reported below the estimate in parenthesis. Coe±cients estimated by Iterated Maximum Likelihood of the Kalman Filter. A value of 0 for ´ indicates that the associated constraint (´ ¸ 0) is binding. and the corresponding AR process, estimated directly, maximizes the likelihood. Coe±cients signi¯cant at the 5% level are reported in bold. Z½ and Zt : Phillips-Perron statistics of the unit root hypothesis H0 (® = 0; ¯ = 1) in the regression: xt = ® + ¯ xt¡1 + ut with 12 lags in the Bartlett window. The associated p-values are reported in parenthesis. Source: IFS line 60ldd and 60ea. Estimated Coe±cients P ¸1 ¸2 ¸3 ¸4 ´ ¸ Model SE SE SE SE SE SE Log-Lik Panel A: Canada-U.S. Obs 100 I 0.90 0 0.90 -0.229 Z½ -8.31 (0.04) (0.04) (0.15) II 0.93 -0.03 0 0.90 -0.226 Zt -2.04 (0.10) (0.10) (0.04) (0.17) III 0.92 0.06 -0.09 0 0.89 -0.199 (0.10) (0.14) (0.10) (0.05) IV 1.37 -0.31 -0.38 0.29 0.42 0.97 -0.181 (0.25) (0.26) (0.07) (0.09) (0.51) (0.02) Panel B: France-U.S. Obs 213 I 0.85 0 0.85 -1.267 Z½ -30.79 (0.03) (0.03) (0.0) II 0.86 -0.01 0 0.85 -1.265 Zt -4.07 continued next page 33 Table 3: Maximum Likelihood Estimation, 3 Months Eurorates continued from previous page Estimated Coe±cients P ¸1 ¸2 ¸3 ¸4 ´ ¸ Model SE SE SE SE SE SE Log-Lik (0.07) (0.07) (0.04) (0.0) III 0.87 -0.06 0.05 0 0.86 -1.260 (0.07) (0.09) (0.07) (0.04) IV 0.87 -0.06 0.06 -0.01 0 0.86 -1.260 (0.07) (0.09) (0.09) (0.47) (0.04) Panel C: Germany-U.S. Obs 213 I 0.96 0 0.96 -0.717 Z½ -5.25 (0.02) (0.02) (0.5) II 1.23 -0.28 0 0.95 -0.686 Zt -1.67 (0.07) (0.07) (0.02) (0.45) III 1.31 -0.63 0.29 0 0.97 -0.653 (0.07) (0.10) (0.07) (0.02) IV 1.31 -0.63 0.28 0.01 0 0.97 -0.653 (0.07) (0.11) (0.11) (0.07) (0.02) Panel D: Italy-U.S. Obs 94 I 0.99 0.25 0.99 -0.610 Z½ -6.58 (0.02) (0.29) (0.02) (0.5) II 0.70 0.25 0 0.95 -0.563 Zt -1.83 (0.10) (0.10) (0.03) (0.30) III 0.74 0.36 -0.16 0 0.94 -0.559 (0.10) (0.12) (0.10) (0.03) continued next page 34 Table 3: Maximum Likelihood Estimation, 3 Months Eurorates continued from previous page Estimated Coe±cients P ¸1 ¸2 ¸3 ¸4 ´ ¸ Model SE SE SE SE SE SE Log-Lik IV 0.69 0.46 0.06 -0.28 0 0.93 -0.528 (0.10) (0.12) (0.12) (0.10) (0.03) Panel E: Japan-U.S. Obs 205 I 0.94 0 0.94 -0.764 Z½ -11.95 (0.02) (0.02) (0.08) II 1.15 -0.22 0 0.93 -0.725 Zt -2.51 (0.07) (0.07) (0.02) (0.12) III 1.14 -0.16 -0.05 0 0.93 -0.723 (0.07) (0.11) (0.07) (0.02) IV 1.14 -0.16 -0.07 0.02 0 0.93 -0.722 (0.07) (0.11) (0.11) (0.07) (0.02) Panel F: U.K.-U.S. Obs 213 I 0.93 0 0.93 -0.776 Z½ -14.70 (0.02) (0.02) (0.05) II 1.10 -0.18 0 0.92 -0.764 Zt -2.80 (0.07) (0.07) (0.02) (0.07) III 1.13 -0.42 0.22 0 0.93 -0.743 (0.07) (0.10) (0.07) (0.02) IV 1.14 -0.43 0.26 -0.04 0 0.93 -0.741 (0.07) (0.10) (0.10) (0.07) (0.02) 35 Table 4: Pooled Maximum Likelihood Estimation of the Market Filter for 3 Months Eurorates ³ ´ Estimates the following state-space model on monthly data: x^¿t = x¿t ~ µ + À ¿t ³ ´ n o where x¿t ~ µ = Et xt+¿ j~µ is the forecast of the interest rate di®erential ¿ periods hence, according to the Kalman Filter with parameter ~ µ: xt = ¹ + zt + º t ; ¸ (L) zt = ²t . xt = rt ¡ rtus is the forward premium. Estimated Coefficients Estimated Coefficients P P ¸1 ¸2 ´ ¸ ¸1 ¸2 ´ ¸ Model SE SE SE SE Log-Lik Obs SE SE SE SE Log-Lik Obs Panel A: Canada-U.S. Panel B: France-U.S. Horizon: All I 0.972 2.99 0.972 0.309 327 0.997 151.88 0.997 -0.861 291 (0.004) (0.65) (0.004) (0.001) (13.43) (0.001) II 0.275 0.675 0.73 0.950 0.323 0.117 0.879 42.76 0.996 -0.836 (0.07) (0.07) (0.26) (0.008) (0.16) (0.16) (8.70) (0.001) Horizon: 3 Months I 0.996 2.40 0.996 0.579 109 1.027 349.75 1.027 -0.723 97 (0.009) (0.788) (0.009) (0.004) (80.11) (0.004) II 0.155 0.828 0.23 0.983 0.684 -0.111 1.150 41.68 1.039 -0.639 (0.08) (0.08) (0.14) (0.012) (0.04) (0.04) (11.38) (0.01) Horizon: 6 Months I 0.974 3.86 0.974 0.448 109 1.009 186.85 1.009 -0.702 97 (0.008) (1.335) (0.008) (0.004) (34.05) (0.004) II 0.385 0.573 1.31 0.958 0.454 0.477 0.536 79.67 1.013 -0.702 (0.034) (0.031) (0.586) (0.013) (0.09) (0.07) (0.34) (0.03) Horizon: 12 Months I 0.969 4.45 0.969 0.072 109 1.005 190.39 1.005 -0.181 97 (0.007) (2.049) (0.007) (0.004) (41.80) (0.004) II 0.246 0.700 1.327 0.946 0.074 0.304 0.704 65.73 1.008 -0.179 (0.047) (0.043) (0.774) (0.012) (0.09) (0.09) (16.31) (0.005) continued next page 36 Table 4: Pooled Maximum Likelihood Estimation of Market Filter continued from previous page Estimated Coefficients Estimated Coefficients P P ¸1 ¸2 ´ ¸ ¸1 ¸2 ´ ¸ Model SE SE SE SE Log-Lik Obs SE SE SE SE Log-Lik Obs Panel C: Germany-U.S. Panel D: Italy-U.S.37 Horizon: All I 0.979 12.26 0.979 0.265 327 1.060 NA 1.060 -1.569 291 (0.002) (9.85) (0.002) (0.001) (NA) (0.001) II 1.76 -0.766 14.89 0.994 0.575 0.008 1.138 NA 1.146 -1.537 (0.01) (0.01) (2.84) (0.003) (0.01) (0.01) (NA) (0.002) Horizon: 3 Months I 0.993 3.853 0.993 0.927 109 1.048 NA 1.048 -1.342 97 (0.003) (1.92) (0.003) (0.002) (NA) (0.002) II 1.766 -0.769 14.64 0.997 1.262 -0.002 1.160 NA 1.157 -1.081 (0.02) (0.02) (4.67) (0.001) (0.008) (0.001) (NA) (0.003) Horizon: 6 Months I 0.978 28.97 0.978 0.039 109 1.06 NA 1.06 -1.469 97 (0.003) (89.44) (0.003) (0.001) (NA) (0.001) II 1.804 -0.810 21.28 0.994 0.481 0.084 1.058 NA 1.142 -1.373 (0.017) (0.016) (6.95) (0.001) (6.16) (6.70) (NA) (0.54) Horizon: 12 Months I 0.978 22.03 0.978 0.206 109 1.059 NA 1.059 -1.380 97 (0.002) (52.91) (0.002) (0.001) (NA) (0.001) II 1.736 -0.742 14.69 0.994 0.468 0.119 1.015 NA 1.134 -1.267 (0.01) (0.02) (4.63) (0.001) (0.04) (0.04) (NA) (0.004) continued next page 37 Table 4: Pooled Maximum Likelihood Estimation of Market Filter continued from previous page Estimated Coefficients Estimated Coefficients P P ¸1 ¸2 ´ ¸ ¸1 ¸2 ´ ¸ Model SE SE SE SE Log-Lik Obs SE SE SE SE Log-Lik Obs Panel E: Japan-U.S. Panel F: U.K.-U.S. Horizon: All I 0.981 3.44 0.981 0.530 327 0.977 1.67 0.977 0.195 327 (0.002) (1.02) (0.002) (0.002) (0.32) (0.002) II 1.690 -0.697 8.58 0.993 0.701 1.766 -0.774 27.34 0.992 0.414 (0.02) (0.02) (1.57) (0.007) (0.02) (0.02) (6.39) (0.001) Horizon: 3 Months I 0.996 0.36 0.996 1.192 109 0.997 1.25 0.997 0.727 109 (0.004) (0.12) (0.004) (0.005) (0.33) (0.005) II 1.784 -0.787 14.96 0.997 1.661 1.690 -0.693 13.52 0.997 0.852 (0.03) (0.03) (5.28) (0.001) (0.07) (0.07) (6.86) (0.002) Horizon: 6 Months I 0.988 5.31 0.988 0.524 109 0.986 1.69 0.986 0.093 109 (0.004) (2.97) (0.004) (0.004) (0.52) (0.004) II 1.781 -0.788 14.22 0.993 1.022 1.806 -0.812 38.72 0.994 0.380 (0.02) (0.024) (3.80) (0.001) (0.03) (0.03) (14.74) (0.001) Horizon: 12 Months I 0.978 0.51 0.978 0.229 109 0.972 1.74 0.972 0.079 109 (0.002) (0.38) (0.002) (0.003) (0.78) (0.003) II 1.527 -0.537 5.28 0.990 0.264 1.776 -0.784 37.43 0.992 0.313 (0.06) (0.06) (2.52) (0.001) (0.02) (0.02) (15.09) (0.001) 38 20 18 16 14 12 10 8 6 4 2 0 20 40 60 80 100 time Figure 1: Delayed overshooting ({) and rational expectation (- -)response to monetary innovation. ¸ = 0:95; k = 0:2: Figure 2: Contour plot of Delayed Overshooting Region as a function of the parameters (¸; k). 39 Figure 3: Contour plot of Asymptotic Limit of Fama coe±cient as a function of k and ¸: 1 0.8 0.6 0.4 0.2 time 10 20 30 40 50 0 -0.2 -0.4 -0.6 -0.8 -1 Figure 4: Figure 4: Fama coe±cient ¯ n as a function of the regression horizon. ¸ = 0:95 and k = 0:2: 40 Figure 5: Euro 3 Months Forward Premium against the US dollar. 41 A. Appendices A.1. Data sources. Variable De¯nition Database Source Series Eurorates US 3 months US deposit DRI IFS L60LD&D@C111 London o®er rate UK average of the 3 months DRI Reuters UKD03A/B eurocurrency deposit bid and ask rate France DRI Reuters FRD03A/B Germany DRI Reuters WGD03AY/BY Italy DRI Reuters ITD03A/B Canada DRI Reuters CAD03A/B Japan DRI Reuters JAD03A/B. Consensus Forecasts 3,6 and 12 months consensus Financial Times forecasts on 3 months Eurorates Currency Forecaster Exchange Rates spot rates DRI IFS A.2. Currency Risk Premium versus Expectational Errors. The standard Fama regression takes the following form: ¢ent = et+n ¡ et = ® + ¯ n xnt + ´ nt+n where xnt = rtn ¡rt¤n is the current n¡period forward discount, equal to the n¡period interest rate di®erential. Under the null that (a) expectations are rational and (b) the currency risk premium is uncorrelated with xnt ; ¯ n = 1: The estimates of this regression are shown in Table 1. Exchange rates are spot rates from DRI-IFS and interest rates are 3, 6 and 12 months eurorates. The regression uses monthly observations and standard errors correct for the overlap between the sampling frequency and the horizon. The results are typical of this literature. The Fama coe±cient is often signi¯cantly negative and almost always smaller than one.38 At longer horizons (12 months), the coe±cients are larger, often becoming not signi¯cantly di®erent from 1. For the sub-period 84:5 to 97:5, we also have exchange rate survey forecasts, compiled by the Financial Times Currency Forecaster. Frankel and Froot (1989) show that the Fama coe±cient can be decomposed into and expectational component bre and a risk premium component brp such that: ¯ = 1 ¡ bre ¡ brp ³ ´ ³ ´ ¡ ¢ cov ukt+k ; xkt var ³ kt + cov Etm et+k ¡ et ; ³ kt bre = ¡ ¡ ¢ ; brp = ¡ ¢ var xkt var xkt 42 where ³ kt is the risk premium for horizon k and ukt+k is the market expectational error et+k ¡ Etm et+k : bre is equal to 0 when this error term is uncorrelated with the forward premium, i.e. expectations are rational. The second term is zero if the risk premium is constant. Table 2 reports the results of this decomposition. The results validate Frankel and Froot (1989)'s earlier analysis: at short horizons (3 to 6 months), for all countries except Italy, the expectational term bre dominates the risk premium term brp by an order of magnitude. If anything, the risk premium term would tend to increase the Fama coe±cient. At 12 months, however, the results are reversed with bre contributing positively to ¯ n . A.3. Kalman Filter Estimation. In this part we derive the Kalman ¯lter equations by following Hamilton (1994). We postulate the following process for the forward premium: (A.1) yt = H0 zt + º t (A.2) zt = Fzt¡1 + ²t ; where yt is a qx1 vector, zt is a px1 state vector, H and F are qxp and pxp matrices, respectively. Equation (A.1) is the measurement equation and (A.2) is the space equation with E fº t g = R and E f²t g = Q where Q and R are respectively a pxp and qxq matrices. De¯ne the informations set It = fyt¡i ; i ¸ 0g ; n¡ ¢¡ ¢0 o ^ ^ t+1jt = E zt+1 ¡ ^ zt+1jt = E fzt+1 jIt g ; and P zt+1jt zt+1 ¡ ^ zt+1jt jIt : The ¯ltering equations are: ³ ´¡1 ¡ ¢ (A.3) ^ zt+1jt = F^ ^ tjt¡1 H H0 P zt¡1jt + FP ^ tjt¡1 H + R yt ¡ H0^ztjt¡1 · ³ ´¡1 ¸ (A.4) ^ t+1jt P ^ ^ 0^ = F Ptjt¡1 ¡ Ptjt¡1 H H Ptjt¡1 H + R 0^ H Ptjt¡1 F0 + Q The smoother equations are: ¡ ¢ ^ tjt F0 P^ ¡1 ^ ^ ztjT = ^ ztjt + P t+1jt zt+1jT ¡ ^ zt+1jt ³ ´³ ´³ ´0 ^ tjT P ^ tjt + P = P ^ ¡1 ^ tjt F0 P ^ t+1jT ¡ P P ^ t+1jt P ^ tjt F0 P ^ ¡1 ; where t+1jt t+1jt ³ ´¡1 ^ tjt P ^ tjt¡1 ¡ P = P ^ tjt¡1 H H0 P ^ tjt¡1 H + R ^ tjt¡1 H0 P Suppose that the current estimate for the state variable at time t is ^ ztjt : According to (A.1)-(A.2), the market forecast for y ¿-periods hence is: ³ ´ (A.5) yt¿ ~µ ´ E fyt+¿ jIt g = H0^zt+¿ jt = H0 F¿ ^ ztjt ³ ´ Suppose now that we observe an imprecise measure of this forecast: y^t¿ = yt¿ ~ µ +À¿t ; where the measurement error À ¿t is uncorrelated with the true forecast. We can estimate ~ µ by minimizing: 43 T ³ ³ ´ XX ³ ´´0 ³ ³ ´´ (A.6) ~ S µ = y^t¿ ¡ yt¿ ~ µ y^t¿ ¡ yt¿ ~ µ ¿ t=1 Applying (A.1)-(A.2) to the interest rate di®erential with up to three lags in the persistent component, we estimate (9)-(10) as well as the market process, using (A.6).Conversely, applying the Kalman ¯lter to the vector of interest rates rt = (rt ; rt¤ )0 ; and the state vector zt = (¹ zt ; zt ; zt¤ ) ; we obtain lemma 1. A.4. Empirical Results on Interest Rate Di®erentials. In this part we develop the esti- mation procedure for our state-space representation. Assume that the State Space representation (9)-(10) ^p1j0 is normally distributed, x holds. Under the normality assumption, and assuming additionally that x ^pt+1 ^pt+1jt and variance P is normally distributed conditionally on It , with mean x ^ t+1jt . We can then write the conditional likelihood of xt+1 as: 0³ ´2 1 p ¯ ¯ xt+1 ¡ ¹ ¡ H0 x ^ ¯ ^ 2 ¯ B t+1jt t+1jt C log fxt+1 jIt (xt+1 jIt ) / log ¯H0 Pt+1jt H + ¾ º I¯ + @ A 0 ^ 2 H Pt+1jt H + ¾º I PT ¡1 We maximize the sample log likelihood t=0 log fxt+1 jIt (xt+1 jIt ) with respect to the vector of parameters ¡ p ¢0 ^p0 µ = f¸i gi=1 ; ´; ¾ 2² ; ¹ .39 To initiate the estimation procedure, we need an estimate of the space variable x and its conditional mean square error. Maximum likelihood estimation over the vector µ is then performed. 0 Once an estimate µ^ is found, we run the smoother in order to revise the initial state vector. That is, the smoother gives us the initial value of the persistent component, conditional on the entire sample information n 0 o and the ¯lter parameters, x^p1 0 = E x ^p0 jIT ; ^ µ , and its mean square error. In general, this revised estimate does not correspond to the initial one. We can then iterate the maximum likelihood estimation with this 1 new initial state variable until convergence to µ^ : Iterating this procedure will give ultimately a parameter vector consistent with the initial state vector.40 A.5. Proofs. Under rational expectations, interest rates and exchange rates follow: (A.7) rt = ln R + H0 zt ¹2 ¡ ' ' ¹ ¤2 + '2 ¡ '¤2 2 Etm fet+1 g ¡ et = (± ¡ ± ¤ ) z¹t + zt ¡ zt¤ + ¾ 2 (A.8) = rt ¡ rt¤ + ³ Proof. Using equations (14) and (15) in the pricing equation (12), we obtain: µ ¶ ¹ 2 ¾2 ' '2 ¾2 rt = ¡ log Et exp ¡ ln R ¡ ¡ ± z¹t ¡ ¡ zt ¡ ' ¹ ¹²t+1 ¡ '²t+1 2 2 44 ¹ 2¾2 ' '2 ¾ 2 = ln R + + ± z¹t + + zt ¡ log Et exp (¡¹' ¹²t+1 ¡ '²t+1 ) 2 2 2 2 2 2 2 2 2 2 ' ¹ ¾ ' ¾ ' ¾ '¹ ¾ = ln R + + ± z¹t + + zt ¡ ¡ 2 2 2 2 = ln R + ± z¹t + zt A similar equation obtains for the foreign interest rate. De¯ning rt as the vector (rt ; rt¤ )0 , we obtain equation (A.7). From equation (13), we obtain: Etm et+1 ¡ et ¤ = Etm log Rt;t+1 ¡ Etm log Rt;t+1 µ ¶ m ¹ ¤2 ¾2 ' ¤ '¤2 ¾2 ¤ ¤ ¤ ¤ = Et ¡ ln R ¡ ¡ ± z¹t ¡ ¡ zt ¡ ' ¹ ¹²t+1 ¡ ' ²t+1 ¡ 2 2 µ ¶ m ¹ 2 ¾2 ' '2 ¾ 2 Et ¡ ln R ¡ ¡ ± z¹t ¡ ¡ zt ¡ ' ¹ ¹²t+1 ¡ '²t+1 2 2 ¾2 ¡ 2 ¢ = (± ¡ ± ¤ ) z¹t + zt ¡ zt¤ + ¹ ¡' ' ¹ ¤2 + '2 ¡ '¤2 2 ¡ ¢ Lemma 4. Assume that beliefs about z1 are initially distributed as N Etm z1j0 ; P1 where Etm z1j0 and P1 are an appropriate vector and matrix respectively. Then: 1. Beliefs evolve according to: ¡ ¢¡1 ¡ ¢ Etm zt+1 = ¸Et¡1 m zt + ¸Pt H H0 Pt H + ¾2v I rt ¡ log R ¡ H0 Et¡1 m zt 2. The conditional variance Pt evolves according to: h ¡ ¢¡1 0 i Pt+1 = ¸2 Pt ¡ Pt H H0 Pt H + ¾ 2v I H Pt + ¾ 2 I in particular, it does not depend upon the actual realizations of the interest rates rt : 3. In the limit as t ! 1; the conditional variance converges to a steady state value P; solution of: h ¡ ¢¡1 0 i P = ¸2 P ¡ PH H0 PH + ¾ 2v I H P + ¾2I and the beliefs evolve according to: ¡ ¢¡1 ¡ ¢ Etm zt+1 = ¸Et¡1 m zt + ¸PH H0 PH + ¾2v I rt ¡ log R ¡ H0 Et¡1 m zt 45 Proof. This is a direct application of the Kalman ¯lter presented in appendix A.3. The Kalman ¯lter consists of the following state and measurement equations: rt = ¡ ln R + H0 zt + vt zt = ¸zt¡1 + ²t so we can apply equations (A.3) and (A.4) with yt = rt ; Etm zt+1 = ^zt+1jt ; F = ¸:I; R = ¾ 2v :I; and Q = ¾2 : n¡ ¢ ¡ ¢0 o m m m I: De¯ne Pt = Et¡1 zt ¡ Et¡1 zt zt ¡ Et¡1 zt : The ¯ltering equation is then ¡ ¢¡1 ¡ ¢ Etm zt+1 m = ¸Et¡1 zt + ¸Pt H H0 Pt H + ¾2v :I rt ¡ log R ¡ H0 Et¡1 m zt h ¡ ¢¡1 0 i Pt+1 = ¸2 Pt ¡ Pt H H0 Pt H + ¾2v :I H Pt + ¾2 :I The rest of the proof is immediate. Pre-multiplying by H0 ; and adding log R on both sides, we obtain: ¡ ¢ ¡ ¢¡1 ¡ ¢ log R + H0 Etm zt+1 = ¸ log R + H0 Et¡1 m zt + ¸H0 PH H0 PH + ¾2v :I rt ¡ log R ¡ H0 Et¡1 m zt + (1 ¡ ¸) log R ¡ ¢ Etm rt+1 m = ¸Et¡1 m rt + ¸K rt ¡ Et¡1 rt + (1 ¡ ¸) log R Etm rt+1 m = ¸ (I ¡ K) Et¡1 rt + ¸Krt + (1 ¡ ¸) log R ¡ ¢¡1 where K = H0 PH H0 PH + ¾2v :I is the gain of the ¯lter. In the case without common shocks (± = ± ¤ = 0) ; this formula simpli¯es to: ~2 ¾ K = k:I = :I2 ~ + ¾ 2v ¾ ¾2 ~ 2 :I = ¡ P = ¾ 2 ¢ :I 1 ¡ ¸ (1 ¡ k) ~ 2 and solving, we obtain: Substituting for ¾ ¡ ¢ 1 + ¢ ¡ ´ 1 ¡ ¸2 k = ¡ ¢ ·1 1 + ¢ + ´ 1 + ¸2 ¡ ¢ 1 + ¢ + ´ 1 + ¸2 2 ~2 ¾ = ¡ ¢¾ 1 + ¢ + ´ 1 ¡ ¸2 ¡ ¡ ¢ ¢2 where ¢ = ´ 1 ¡ ¸2 + 1 + 4´¸2 and ´ = ¾2v =¾2 is the noise-to-signal ratio. Under misperception, we m postulate that the exchange rate is a function of the current interest rates rt and the current forecasts Et¡1 zt ; as well as a long term equilibrium value e¹t that satis¯es e¹t = limT !1 Etm et+T : Iterating on the uncovered interest rate parity condition, we obtain: 1 X 1 X et = ¡ Etm xt+j + lim Etm et+T = ¡ Etm xt+j + e¹t T !1 j=0 j=0 46 To compute the exchange rate, we solve for Etm xt+j at all j ¸ 2 : Etm xt+j = l0 Etm rt+j © m ª = l0 Etm Et+j¡1 rt+j © ª = l0 Etm ¸ (I ¡ K) Et+j¡2 m rt+j¡1 + ¸Krt+j¡1 + (1 ¡ ¸) log R = l0 ¸ (I ¡ K) Etm rt+j¡1 + l0 ¸K Etm rt+j¡1 = ¸ l0 Etm rt+j¡1 = ¸j¡1 Etm xt+1 ; where the second line uses the law of iterated expectations and the third line uses l0 (1 ¡ ¸) log R = 0: Substituting back into the expression for the exchange rate, we obtain: 1 X et = ¡xt ¡ Etm xt+1 ¡ Etm xt+1 ¸j¡1 + e¹t j=2 1 = ¡xt ¡ E m xt+1 + e¹t 1¡¸ t 1 0 m (A.9) = ¡l0 rt ¡ l E rt+1 + e¹t 1¡¸ t Under rational expectation, Etm rt+1 = ¸rt and the exchange rate follows: 1 0 ert = ¡ l rt + e¹t ; 1¡¸ where we are assuming implicitly that limT !1 Etm et+T = limT !1 Et et+T = e¹t : In other words, we are assuming that there are no long run prediction errors. To calculate predictable excess returns, recall from equation (2) that » t = Etm et+1 ¡ Et et+1 : Substituting (A.9), we obtain: µ ¶ m 0 1 0 m » t = Et ¡l rt+1 ¡ l E rt+2 + e¹t+1 1 ¡ ¸ t+1 µ ¶ 0 1 0 m ¡Et ¡l rt+1 ¡ l E rt+2 + e¹t+1 1 ¡ ¸ t+1 µ µ ¶ ¶ m 0 ¸ ¸ 0 m = ¡Et l I + K rt+1 + l (I ¡ K) Et rt+1 ¡ e¹t+1 1¡¸ 1¡¸ µ µ ¶ ¶ 0 ¸ ¸ 0 m +Et l I + K rt+1 + l (I ¡ K) Et rt+1 ¡ e¹t+1 1¡¸ 1¡¸ µ ¶ ¸ = l0 I + K (Et rt+1 ¡ Etm rt+1 ) ; 1¡¸ m where the second line uses the de¯nition of Et+1 rt+2 and the assumption that Et e¹t+1 = Etm e¹t+1 : Lemma 5. The coe±cient from the regression of realized depreciation rates on the forward premium con- verges in plim to ³ ´ ³ ¡ ¢¡1 2 ´ ¸l0 I + ¸K 1¡¸ (I ¡ K) I ¡ I ¡ ¸2 (I ¡ K) ¸ K var (rt ) l (A.10) p lim ¯ = 1 ¡ l0 var (rt ) l 47 Proof. According to its de¯nition, cov (et+1 ¡ et ; xt ) p lim ¯ = var (xt ) cov (Et et+1 ¡ et + ut+1 ; xt ) = var (xt ) cov (Et et+1 ¡ et ; xt ) = var (xt ) cov (Et et+1 ¡ Etm et+1 + Etm et+1 ¡ et ; xt ) = var (xt ) cov (Et et+1 ¡ Etm et+1 + xt ; xt ) = var (xt ) cov (» t ; xt ) = 1¡ var (xt ) where ut+1 = et+1 ¡Et et+1 is uncorrelated with xt and the ¯fth line uses the uncovered interest parity. Using the de¯nition of » t ; we see that µ ¶ 0 ¸ cov (» t ; xt ) = l I + K cov (Et rt+1 ¡ Etm rt+1 ; rt ) l 1¡¸ µ ¶ 0 ¸ ¡¡ m ¢ ¢ = ¸l I + K (I ¡ K) cov rt ¡ Et¡1 rt ; rt l 1¡¸ µ ¶ 0 ¸ ¡ ¡ m ¢¢ = ¸l I + K (I ¡ K) var (rt ) ¡ cov Et¡1 rt ; rt l 1¡¸ and ¡ m ¢ ¡ m ¢ cov Et¡1 rt ; rt = cov ¸ (I ¡ K) Et¡2 rt¡1 + ¸Krt¡1 + (1 ¡ ¸) log R; ¸rt¡1 + H0 ²t ¡ m ¢ = cov ¸ (I ¡ K) Et¡2 rt¡1 + ¸Krt¡1 ; ¸rt¡1 ¡ m ¢ = ¸2 (I ¡ K) cov Et¡2 rt¡1 ; rt¡1 + ¸2 K var (rt¡1 ) so that ¡ m ¢ ¡ ¢¡1 2 cov Et¡1 rt ; rt = I ¡ ¸2 (I ¡ K) ¸ K var (rt ) and µ ¶ ³ ¸ ¡ ¢¡1 2 ´ 0 cov (» t ; xt ) = ¸l I + K (I ¡ K) I ¡ I ¡ ¸2 (I ¡ K) ¸ K var (rt ) l 1¡¸ Thus, the Fama coe±cient equals: ³ ´ ³ ¡ ¢¡1 2 ´ ¸l0 I + ¸ 1¡¸ K (I ¡ K) I ¡ I ¡ ¸2 (I ¡ K) ¸ K var (rt ) l p lim ¯ = 1 ¡ l0 var (rt ) l in the case where there are no common shocks, the formula simpli¯es to ¸ (1 ¡ ¸ (1 ¡ k)) (1 ¡ k) (1 + ¸ ) p lim ¯ = 1 ¡ 1 ¡ ¸2 (1 ¡ k) 48 In the case without common shocks, one can also derive the Fama coe±cient from a regression at horizon n: To do so, we must ¯rst derive the yield on a n period bond. De¯ne rtn the continuously compounded yield on a n period zero-coupon bond. rtn satis¯es: 1 rtn = ¡ log Etm Rt;t+n n Since Rt;t+n = Rt;t+1 ::::Rt+n¡1;t+n ; we observe that rtn can be de¯ned recursively as the solution to: © ¡ ¢ª (A.11) ¡ (n + 1) rtn+1 = log Etm Rt;t+1 exp ¡nrt+1 n where rt1 = rt : We conjecture a linear solution of the form: (A.12) rtn = An + Bn rt + Cn Etm rt+1 '2 ¾ 2 with A1 = log R; B1 = 1 and C1 = 0: Using the fact that log Rt;t+1 = ¡ ln R ¡ 2 ¡ zt ¡ '²t+1 = 2 2 ' ¾ ¡rt ¡ 2 ¡ '²t+1 ; and substituting equation (A.12) into the term structure equation (A.11), we obtain: ½ µ ¶¾ '2 ¾2 ¡ ¢ ¡ (n + 1) rtn+1 = log Etm exp ¡rt ¡ m ¡ '²t+1 ¡ n An + Bn rt+1 + Cn Et+1 rt+2 2 '2 ¾ 2 = ¡rt ¡ ¡ nAn ¡ nBn Etm rt+1 ¡ nCn (¸Etm rt+1 + (1 ¡ ¸) log R) 2 + log Etm fexp ('²t+1 ¡ n (Cn ¸k + Bn ) (rt+1 ¡ Etm rt+1 ))g = ¡ (n + 1) (An+1 + Bn+1 rt + Cn+1 Etm rt+1 ) Identifying coe±cients, we obtain: (n + 1) Bn+1 = 1 (n + 1) Cn+1 = ¸ n Cn + nBn '2 ¾2 (n + 1) An = + nAn ¡ nCn (1 ¡ ¸) log R 2 + log Etm fexp ('²t+1 ¡ n (Cn ¸k + Bn ) (rt+1 ¡ Etm rt+1 ))g Solving for Bn and Cn , we obtain 1 1 1 ¡ ¸n¡1 m rtn = An + rt + E rt+1 n n 1¡¸ t The interpretation is straightforward: except for an intercept term that captures a constant risk premium, the long term yield is a simple average of the short term interest rate and future expected short rates: 1 n (rt + Etm rt+1 + ::: + Etm rt+n¡1 ) : Similarly, we can calculate predictable excess returns at horizon n; » nt = Etm et+n ¡ Et et+n : Since we are assuming that there is no misperception about the equilibrium exchange rate 49 e¹t ; we know that limn!1 » nt = 0 by construction. Therefore, the Fama coe±cient must converge to one. At shorter horizons, substituting repeatedly from (1), we obtain: 1 X 1 X » nt = m Et Et+n xt+n+j ¡ Etm xt+n+j j=0 j=0 For n ¸ 1 and j ¸ 1 we have: Etm xt+n+j = ¸n+j¡1 Etm xt+1 m Et Et+n xt+n+j = ¸j¡1 Et Et+n m xt+n+1 ¡ ¢ = ¸j¡1 Et ¸ (1 ¡ k) Et+n¡1 m rt+n + ¸krt+n ¡ ¢ = ¸j¡1 Et ¸ (1 ¡ k) Et+n¡1 m rt+n + ¸krt+n ³ h i ´ = ¸j¡1 Et rt+1 ¸n k 1 + (1 ¡ k) + :: + (1 ¡ k)n¡1 + (¸ (1 ¡ k))n Etm rt+1 = ¸n+j¡1 (Et rt+1 (1 ¡ (1 ¡ k)n ) + (1 ¡ k)n Etm rt+1 ) : Thus, 1 X n n » nt = ¸n¡1 Et xt+1 + (Et rt+1 (1 ¡ (1 ¡ k) ) + (1 ¡ k) Etm rt+1 ) ¸n+j¡1 j=1 1 X ¡Etm xt+1 ¸n+j¡1 j=0 n¡1 ¸ ¸n ¸n¡1 m = Et xt+1 + (1 ¡ k)n (Etm rt+1 ¡ Et rt+1 ) ¡ E xt+1 1¡¸ 1¡¸ 1¡¸ t ¸n¡1 = (1 ¡ ¸ (1 ¡ k)n ) (Et xt+1 ¡ Etm xt+1 ) 1¡¸ The ¯rs term re°ects the interest rate e®ect, while the second term re°ects the learning e®ect. As the horizon increases, the interest rate e®ect brings predictable excess returns back to zero. Consider now a Fama regression at horizon n : et+n ¡ et = ®n + ¯ n xnt + unt+n where xnt = l0 rnt = rtn ¡ rt¤;n is the n-period forward premium. The Fama coe±cient at horizon n converges to: cov (et+n ¡ et ; xnt ) cov (» nt ; xnt ) p lim ¯ n = = 1 ¡ var (xnt ) var (xnt ) Observe that it is not enough to have limn!1 » nt = 0 to conclude that limn!1 p lim ¯ n = 1 because the variance of the forward premium also converges to 0: In fact, we can write: µ ¶ n n ¸n¡1 n m 1 1 1 ¡ ¸n¡1 m cov (» t ; xt ) = (1 ¡ ¸ (1 ¡ k) ) cov ¸xt ¡ Et xt+1 ; xt + E xt+1 1¡¸ n n 1¡¸ t µ ¶ ¸n n m 1 1 1 ¡ ¸n¡1 ¡ m ¢ = (1 ¡ ¸ (1 ¡ k) ) (1 ¡ k) cov xt ¡ Et¡1 xt ; xt + ¸ (1 ¡ k) Et¡1 xt + ¸kxt 1¡¸ n n 1¡¸ 50 Solving for each term, we have: ¡ m ¢ ¸2 k cov xt ; Et¡1 xt = var (xt ) 1 ¡ ¸2 (1 ¡ k) ¡ m ¢ ¡ m ¢ var Et¡1 xt = var ¸ (1 ¡ k) Et¡2 xt¡1 + ¸kxt¡1 µ ¶ ¸2 k2 1 + ¸2 (1 ¡ k) = 2 var (xt ) 1 ¡ ¸2 (1 ¡ k) 1 ¡ ¸2 (1 ¡ k) Lastly µ ¶ 1 1 1 ¡ ¸n¡1 m var (xnt ) = var xt + E xt+1 n n 1¡¸ t Substituting, we obtain the expression for ¯ n plotted in Figure 4. 51 Notes 1 See Hodrick (1988) and Lewis (1994) for surveys. 2 The view that agents constantly engage in learning about the stance and duration of monetary policy seems rather uncontroversial. Suppose, for instance, that monetary authorities control the short term nominal interest rate, as in the US and many other countries. A setup with transitory and persistent interest rate shocks seems appropriate to capture agents' reactions to Federal Open Market Committee (FOMC) meetings (See Batten, Blackwell, Kim, Nocera and Ozeki (1990) for a description of the operating procedures of major central banks). We could think of each meeting as an interest rate innovation. Since investors only have limited information regarding the most current monetary decisions, they must conjecture how persistent the last decision of the FOMC will be. For instance, access to the minutes of a FOMC meeting are only availabe after a six weeks delay, at the following meeting. 3 Interestingly these results open the possibility that exchange rate determination and term structure are consistent with one another. 4 The results of Clarida and Gali (1994), Grilli and Roubini (1994) nuance the original results of Eichen- baum and Evans (1995) and indicate that delayed overshooting may not occur for all country pairs. 5 See appendix A.2 for details. 6 See Lewis (1994) and Frankel and Rose (1994) for surveys. 7 See Appendix A.2 for details. 8 Another class of models with expectational errors is the so-called \Peso problem," whereby if an ex- pected shift in regime does not materialize in sample, expectations will appear systematically biased to the econometrician. Kaminski (1993) shows that Peso problems can account for part of the forward discount premium in a model in which regime switches follow a Markov process. 9 This ¯nance literature also emphasizes overreaction at longer horizons, as evidenced by long term neg- ative correlation in returns (DeBondt and Thaler (1985)). The over-reaction is not relevant for our inter- pretation of the puzzles. However, we note that our results also indicate that conditional persistence, as perceived by market participants, is larger than the data indicate. Thus, at longer horizon, forward premia forecasts overreact to current premium. 52 10 Two well known examples are Hong and Stein (1999) and Barberis, Shleifer and Vishny (1998). The former assumes that there are two classes of agents, `newswatcher' who cannot condition on past prices but observe private signals about the fundamentals, and `momentum' traders only condition on past prices. The latter presents a model of investor sentiment consistent with experimental psychological evidence. In their model, the representative investor alternates optimally between two mental representations of earnings (a mean reverting and a trend one), neither of which corresponds to the true process (a random walk). 11 Equivalently, the second source of °uctuations arises if the long run equilibrium exchange rate does not follow a martingale: Et e¹t+1 6= e¹t : 12 Formula (5) is derived under the assumption that initially agents believe that x1 » N (x1 ; ¾1 ): p 1+(1¡k)¿ ¡ Á 2 13 The boundary of D¿ is given by: ¸ (k; ¿ ) = 2(1¡k)¿ +1 where Á = [1 + (1 ¡ k)¿ ] ¡ 4 (1 ¡ k)¿ +1 . 14 Similar processes have often been used in the learning literature, starting with Muth (1960) in his exploration of the link between rational and adaptive expectations. 15 A more realistic interest rate process would incorporate conditional heteroskedasticity to capture the tendency in ¯nancial data for volatility clustering -i.e., the tendency for large (small) price changes to be followed by other large (small) price changes of unpredictable sign. ARCH models and their various extensions have been successfully applied to several ¯nancial time series (see Bollerslev, Chou and Kroner (1992) for a survey), including interest rates. 16 Indeed, this representation is equivalent to a restricted ARMA process. 17 We do not want to push this interpretation too far. One reason is that the empirical literature on money, output and interest rates tries to separate the exogenous and endogenous components of money shocks. Our univariate representation does not allow for this distinction. 18 In the U.S., investors have access to the minutes of the Federal Open Market Committee meetings with a six weeks delay. The full transcript is only available after 3 years. See Batten et al. (1990) for a description of of the operating procedures of the major Central Banks. 19 See Hamilton (1994, chapter 13) for further details. 20 Our focus on eurorates is driven by the availability of forecast data for all G-7 countries. 21 The 3 months Eurorates come from the IFS tape (lines 60ldd, and 60ea). 53 22 This statement understates our empirical results: in most cases, the constraint that ´ ¸ 0 is binding. In such cases, we estimated directly an AR process. 23 The use of eurorates may be somewhat problematic, as the forward discount premium may re°ect a country speci¯c risk premium. The working paper version of this paper presents the results from a similar estimation on Money market rates. The results are virtually unchanged. 24 The Forecasting services that contribute to the Currency Forecaster's Digest are: Predex, Merril Lynch, Mellon Bank, Harris Trust, Bank of America, Morgan Grenfell, Chase Manhattan, Royal Bank of Canada, Midland Montagu, Generale de Banque, MMS International, Chemical Bank, Union Bank of Switzerland, Multinational Computer Models, Goldman Sachs International, Business International, M. Murenbeeld, and Westpac Bank. The multinational companies that contribute are: General Electric, Du Pont, WR Grace, Allied Signal, Monsanto, Ingersoll-Rand, General-Motors, Data General, Eli Lilly, Aetna, American Express, Johnson and Johnson, Sterling Drug, Firestone, 3M, Union Carbide, Texaco, United Brands, SmithKline Beckman, American National Can, RJ Reynolds, Colgate-Palmolive, Warner-Lambert, Schering-Plough, Quaker Oats, Beatrice Foods, Hercules, Baxter Travenol, and Interpublic Group. 25 Froot (1989) uses quarterly data on the three months T-bill from 1969 to 1986 from the Nagan Bond and Money Market Letter. This dataset has also been used in Friedman (1980). 26 See Zarnowitz and Braun (1992). 27 Prendergast and Stole (1996) present a model where the incentive to scatter or mimic depends on age, with younger forecasters more likely to \exaggerate". 28 Ehrbeck and Waldmann (1996) conclude that models of stretegic bias are rejected by the data. 29 While this indicates that the process might not be perceived as stationary, ¯nite horizon forecasts are still well de¯ned. 30 Our procedure did not converge for the Italian Lira. 31 This relation pins down the rate of depreciation, not the level of the exchange rate. It is consistent with environments in which the law of one price or purchasing power parity holds, as well as with environments in which it does not. 32 The common shock still a®ects the determination of interest rates. 33 Since the z 0 s are conditionally normally distributed, nothing prevents the nominal interest rates rt from 54 being negative in this model, which is a somewhat unpleasant feature. It is possible to ¯x this problem along the lines of Cox Ingersoll and Ross by assuming a time-varying conditional variance of ²t . However, the learning problem described in the next section becomes history dependent and one does not obtain closed-form solutions. We assume instead that we remain in the vicinity of ln R so that this problem is irrelevant. 34 The gain k~ is constructed from the value of ´~ estimated in table 4 for an AR(1) and forecast data at all horizons, and the value of ¸ from the true process (table 3, AR(1)). Since the true ¸ are often smaller than the market ones, we are biasing the results against ¯nding delayed overshooting or the Fama puzzle. We exclude the Italy Lira since the market ¯lter did not converge. 35 If interest rates follow a white noise process (¸ = 0), there are no forecast misperceptions (since the agents are biased in favor of a white noise to start with) and no predictable currency movements. 36 With the exception of the Italian Lira. See footnote 34. 37 The ML estimator of ´ failed to converge for Italy-US, we report estimates using a value of ´ = 160000 38 The exceptions are Italy at all horizons and Japan at 12 months. 39 The estimation takes into account the maturity longer than the sampling frequency. 40 The asymptotic properties are the same whether we iterate or not. 55