Optimal Control of Trading AlgorithmsA General Impulse Control Approach_第1页
Optimal Control of Trading AlgorithmsA General Impulse Control Approach_第2页
Optimal Control of Trading AlgorithmsA General Impulse Control Approach_第3页
Optimal Control of Trading AlgorithmsA General Impulse Control Approach_第4页
Optimal Control of Trading AlgorithmsA General Impulse Control Approach_第5页
已阅读5页,还剩30页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

Copyright by SIAM. Unauthorized reproduction of this article is prohibited. SIAM J. FINANCIAL MATH. c 2011 Society for Industrial and Applied MathematicsVol. 2, pp. 404438Optimal Control of Trading Algorithms: A General Impulse Control ApproachBruno Bouchard, Ngoc-Minh Dang, and Charles-Albert LehalleAbstract. We propose a general framework for intraday trading based on the control of trading algorithms.Given a set of generic parameterized algorithms (which have to be specified by the controller ex-ante), our aim is to optimize the dates (i)iat which they are launched, the length (i)iof thetrading period, and the value of the parameters (Ei)ikept during the time interval i,i+ i).This provides the financial agent a decision tool for selecting which algorithm (and for which setof parameters and time length) should be used in the dierent phases of the trading period. Fromthe mathematical point of view, this gives rise to a nonclassical impulse control problem wherenot only the regime Eibut also the period i,i+ i) have to be determined by the controller atthe impulse time i. We adapt the weak dynamic programming principle of Bouchard and TouziSIAM J. Control Optim., 49 (2011), pp. 948962 to our context to provide a characterization ofthe associated value function as a discontinuous viscosity solution of a system of partial dierentialequations with appropriate boundaryconditions, for which we provea comparison principle. Wealsopropose a numerical scheme for the resolution of the above system and show that it is convergent.We finally provide a simple example of application to a problem of optimal stock trading with anonlinear market impact function. This shows how parameters adapt to the market.Key words. optimal impulse control, discontinuous viscosity solutions, intraday tradingAMS subject classifications. 93E20, 49L25, 91B28DOI. 10.1137/0907772931. Introduction. Trading algorithms are nowadays widely spread among financial agents.They are typically used for high frequency intraday trading purposes, e.g., for “statisticalarbitrage” or for the execution of large orders by brokers. In both cases, the use of robots isjustified by the fact that orders have to be executed very quickly, in order to make profit of“good prices,” and, typically for brokers, by the large size of the portfolios to be handled bya limited number of traders.Many eorts have been devoted in recent years to building ecient trading algorithms,taking all possible market features into account, and in particular the so-called market impacteect. Some of the most popular ones have been proposed by academics (see, e.g., 1, 2, 3,6, and 11); a lot of them are developed in research divisions of brokers or investment banksand are somehow kept secret as fundamental building blocks of their everyday business.Received by the editors November 16, 2009; accepted for publication (in revised form) February 15, 2011;published electronically June 7, 2011./journals/sifin/2/77729.htmlCEREMADE, Universite Paris Dauphine and CREST, ENSAE-ParisTech, Paris Cedex 16, France (bouchardceremade.dauphine.fr).CEREMADE, Universite Paris Dauphine and CA Cheuvreux, Paris Cedex 16, France (dangceremade.dauphine.fr).CA Cheuvreux, Paris La Defense 92920, France ().404Downloaded 03/14/15 to 66. Redistribution subject to SIAM license or copyright; see /journals/ojsa.php Copyright by SIAM. Unauthorized reproduction of this article is prohibited. OPTIMAL CONTROL OF TRADING ALGORITHMS 405One could expect that these algorithms correspond to global optimization problems andare run on the whole trading period without interruption. In practice, this is often not thecase, in particular among brokers.In fact the way brokers execute large orders is typically as follows: they split the globalnumber of assets to be bought or sold into small pieces, called slices, and then use robots toexecute these dierent slices one by one. Each time a new slice is launched, the trader tunesthe parameters of the algorithm (and possibly the size of the slice) depending on the evolutionof the markets conditions. He can even decide to use a dierent algorithm from one slice tothe other (which can actually also be viewed as changing the parameters of a single robot, atleast from the mathematical point of view).Thereal life situation is thusthefollowing: given a bunch of trading algorithms, thetraderdecides how to slice the order (i.e., divide the global order in smaller parts), at what timeeach slice is launched, and with which algorithm and values of the parameters. Moreover, healso decides how long he should let the algorithm run. In practice, there exists a minimaltime period during which each algorithm is executed. One reason for this is simply that thetrader cannot practically monitor all the algorithms that are running for dierent purposesat the same time. Another one is that it makes no sense (in practice) to launch an algorithmfor less than, say, one second.The aim of this paper is to provide a decision tool for traders given the above describedpracticalsituation. Namely, weproposearigorousframeworkfortheoptimalcontroloftradingalgorithms: how does one decide optimally the times at which algorithms are launched, whatparameters to use, and for how long? Obviously, one could argue that it would be betterto discuss a global optimization problem, i.e., discuss the optimal control problem associatedwith the global trading agenda. This is not the aim of this paper. Most practitioners havetheir own algorithms and do not want to have the same as the others. Our approach allowsus to adapt to the algorithms each trader wants to use (optimal or not), and help them to usethese algorithms in an optimal way, and in particular to tune the parameters. Exactly like thecomputation of greeks serving as a reference for the hedging policy of an option, which is thenmore or less followed by the trader depending on his own feeling of the markets conditions(and because he can in practice only discretely rebalance his portfolio), we aim at providingon-line values of the parameters that should be optimal in some sense and therefore shouldserve as references for the trader in the way he launches the dierent slices.We are therefore not interested here in the trading algorithms themselves. The aim of thispaper is not to come up with a new algorithm but to provide a rigorous decision tool for theuse of already existing trading algorithms.From the mathematical point of view, it gives rise to a nonclassical impulse control prob-lem. As for standard impulse control problems (see, e.g., the reference book 5), one choosesthe times at which “impulses” are given (times at which an algorithm is launched) and thesize of the impulse (the value of the parameters). The novelty comes from the fact that onealso chooses a time period duringwhich no new impulse can begiven (the period duringwhichthe algorithm runs). To the best of our knowledge, such problems have never been discussedin the mathematical literature on optimal impulse control.In this paper, we provide a rigorous characterization of the value function as a discontinu-ous viscosity solution of a partial dierential equation (PDE), together with suitable boundaryDownloaded 03/14/15 to 66. Redistribution subject to SIAM license or copyright; see /journals/ojsa.php Copyright by SIAM. Unauthorized reproduction of this article is prohibited. 406 B. BOUCHARD, N.-M. DANG, AND C.-A. LEHALLEconditions. To this end, we adapt the approach of Bouchard and Touzi 7, who propose aweak version of the dynamic programming principle. The main advantage of this weak for-mulation is that it does not require any a priori continuity of the value function. It is the firsttime this approach is used in the context of impulse control problems, and this requires somenontrivial modifications of the arguments of 7.Our PDE characterization reads as follows. When the current regime is the passive one,i.e., no trading algorithm is running, the controller can launch one at any moment iwitha given set of parameters Eiand for a period of length i. This gives rise to a standardquasi-variational inequality in the region corresponding to the passive regime. However, oncethe algorithm is launched, no change in the value of the parameters can be made before theend of the period i,i+i). This implies that the value function satisfies a linear parabolicequation on the active region. We also provide a comparison principle for the above equationsand construct a finite dierence numerical scheme, which we prove to be convergent.The rest of the paper is organized as follows. The model is described in section 2.Insection 3, we provide the PDE characterization of the value function and the associatedcomparison principle. The proofs of these results are given in section 4.Thenumericalscheme is studied in section 5. In the last section, we discuss a very simple numerical exampleof application to a particular model of optimal stock acquisition. It shows how the proposedframework naturally allows a real-time adaptive control of the trading algorithm, by switchingoptimally after the end of each slice given the current state of the market.Notation. All over this paper, we shall use the following notation. Given x Rk,forkgiven by the context, we denote by |x| its Euclidean norm and by Br(x) the open ball ofcenter x and radius r0. The scalar product is denoted by ,. Given a set A Rk,we denote by A its boundary. Given d N,wedenotebyMdthe set of d-dimensionalsquare matrices. For M Md, Mis the associated transposed matrix. For a function(t,x,y) R+ Rd Rkmapsto (t,x,y), we denote by D and D2 its gradient and Hessianmatrix with respect to x, whenever they are well defined. The other partial derivatives willbe written by using standard notation.2. Problem formulation. Let (,F,P) be a probability space supporting a d-dimensionalBrownian motion W, d 1. Let F := (Ft)t0denote the right-continuous complete filtrationgenerated by W,andletT0 be a finite time horizon.2.1. Control policies. We firstdescribe how the trading algorithms are controlled; precisedynamics will be imposed in section 2.2. Note that dierent algorithms can be viewed asa single parameterized one. In what follows, we therefore consider that we have only onealgorithm.A control policy of the trading algorithm is described by a nondecreasing sequence ofstopping times (i)i1and a sequence of ,)E-valued random variables (i,Ei)i1.Thestopping times idescribe the times at which an order is given to the algorithm, Eiis thevalue of the parameters with which the algorithm is run, and iis the length of the period(latency period) during which it is run with the value Ei.ThesetE is a compact subset ofRd, which represents the possible values of the parameters, and the quantity0 0 i+i T) ,i 1 .(2.1)The first condition expresses the fact that a new order cannot be given before the end of thetime period associated with the previous order. The second one means that an order shouldbe given only if it ends before the final time horizon T.Remark 2.1. 1. The minimal duration constraint, i with 0, has been justifiedfrom a practical point of view in the introduction. From the mathematical point of view,the problem described in sections 2.2 and 2.3 would not make sense without this conditionif no additional cost related to launching the algorithm with new parameters is introduced.Indeed, for = 0, and without additional costs, the controller could, at the limit, control theparameters continuously, and this would actually certainly be optimal. The controller willthen act, at the limit, as a trader acting continuously on the market.2. Models with = 0 and with an additional cost (paid each time the algorithm islaunched with new parameters) could be discussed by following the lines of this paper. Such acost is actually already embedded in the general dynamics of section 2.2,uptoanadditionalassumption on the function ;seeExample2.10. This would, however, require justifying thiscost and evaluating it in practice. Moreover, certain bounds like the one stated in Remark 2.2would no longer be true, and other conditions would be required to retrieve the estimates ofRemark 2.5. For the sake of simplicity, we therefore stick to the case 0, which correspondsmore, from our point of view, to practical situations.3. In the absence of a cost penalizing frequent changes in the parameters, it may beoptimal to choose i= most of the time (depending on whether is small or not). However,other values of imay also be optimal. Our algorithm provides a way to select the maximaloptimal value, the one for which the trader changes the parameters the least often (which isdesirable in practice).As usual the value of the parameters and the size of the latency period cannot be chosenin some anticipative way; i.e., we impose that(i,Ei)isFi-measurable ,i 1 .(2.2)At time t i,i+i), the value of the parameter of the trading algorithm is denoted by t.For t A(i,i)i1), defined asA(i,i)i1):=R+uniondisplayi1i,i+i),we set t= pi1,wherepi1 RdE can be viewed as a cemetery point; recall that E is compact.It follows that the value of the parameters of the trading algorithm can be written ast= pi11tA(i,i)i1)+summationdisplayi1Ei1ti,i+i),t 0,T ,(2.3)where t= pi1 means that the algorithm is not running at time t.Downloaded 03/14/15 to 66. Redistribution subject to SIAM license or copyright; see /journals/ojsa.php Copyright by SIAM. Unauthorized reproduction of this article is prohibited. 408 B. BOUCHARD, N.-M. DANG, AND C.-A. LEHALLEInthefollowing,wedenotebyS the set of adapted processes that can be written inthe form (2.3) for some sequence of stopping times (i)i1and of ,)E-valued randomvariables (i,Ei)i1satisfying (2.1)and(2.2).For ease of notation, we shall now write(i,i,Ei)i1the sequence associated with Sand define, for all stopping times 1and 2satisfying 1 2P-a.s., the set of indicescorresponding to orders whose execution ends between 1and 2:I1,2:= i 1:10,thealgorithmisrunningwithavalueoftheparameterst.Whent= 0, the algorithm is no longer running, t= pi1, and a new order can be passed.The following picture sums up the dynamics of the control:a54a45Timepi1Ei Ei+1ia45a27ii+ii+1a45a27i+1ta45a27ti+1+i+1Downloaded 03/14/15 to 66. Redistribution subject to SIAM license or copyright; see /journals/ojsa.php Copyright by SIAM. Unauthorized reproduction of this article is prohibited. OPTIMAL CONTROL OF TRADING ALGORITHMS 4092.2. Output of the trading algorithm. Given some initial data (t,x) 0,T Rd,theoutput of the trading algorithm associated with some control policy Sis defined as thestrong solution Xt,xon 0,T of the SDE(2.4)Xt,x(s)=x+1stparenleftBiggintegraldisplaystb(Xt,x(r),r)dr +integraldisplaysta(Xt,x(r),r)dWr+summationdisplayi1(Xt,x(i),Ei,i)1t0 for which, for all x,xprime Rd, e,eprimeE, ,prime ,T,|(x,e,) (xprime,e,)|) K|xxprime|(x,e,)|K(1+|x|)|(x,e,) (x,eprime,prime)|K(1+|x|)(|eeprime|+| prime|)for = b,a, .(2.5)We do not dierentiate here between the components that correspond to real outputs ofthe algorithm (cumulated gains, cumulated volumes executed by the algorithm, etc.) andothers that simply describe the evolution of financial data or market factors (prices of thetraded assets, global traded volumes on the markets, volatilities, etc.).The jumps on the dynamics are introduced to model the changes in the initial conditionson the variables of interest for the trading algorithm when it is launched (e.g., volume to beexecuted between iand i+i); see Example 2.7.Moreover, there is no loss of generality in assuming that X, W,and have the samedimension d. One can always reduce more general situations to this case by playing with thecoecients b, a, and with the choice of E. Time-dependent coecients can similarly beconsidered by putting the first line of a and equal to 0, and the first component of b equalto 1, so that the first component of X actually coincides with the time parameter.We refer the reader to section 2.4 for the description of simple examples of applicationwhich illustrate the flexibility of the above model.We conclude this section with two technical remarks that will be used later.Remark 2.3. Observe that Xdoes not jump when the regime is switched to the passiveregime pi1.Remark 2.4. Note that (2.5), the fact that E is bounded, and Remark 2.2 imply that, forall t 0,T, x Rd,and S,EbracketleftBiggsupst,T|Xt,x(s)|pbracketrightBigg CpK(1+|x|p) ,(2.6)where CpKdepends only on K and p 1.2.3. Gain function. The aim of the controller is to maximize the expected value of thegain functional Smapsto t,x():=g(Xt,x(T)+summationdisplayiIt,Tf(Xt,x(i+i),Ei) ,Downloaded 03/14/15 to 66. Redistribution subject to SIAM license or copyright; see /journals/ojsa.php Copyright by SIAM. Unauthorized reproduction of this article is prohibited. 410 B. BOUCHARD, N.-M. DANG, AND C.-A. LEHALLEwith the usual conventionsummationtext= 0, within the setSt,e:=braceleftbiggbraceleftbig S : s= e for s t,t+)andt+=0bracerightbigif e negationslash= pi1 and 0 , S : t= pi1 otherwise ,where (,e) R+E denotes the initial state of the remaining latency time and value of theparameters.Here, g and f are assumed to be continuous on RdE and to satisfy, for some 0,sup(x,e)RdE|f(x,e)| + |g(x)|1+|x|0 which depends only on K and such that|V(t,x,e)|CK(1+|x|) for all (t,x,e) 0,TRdR+E such that Sat,enegationslash= .Note that

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论