IROS2019国际学术会议论文集2013

上传人：我*** IP属地：北京上传时间：2020-06-05 格式：PDF 页数：6 大小：1.46MB 积分：12 举报 版权申诉

免费预览已结束，剩余1页可下载查看

下载本文档

版权说明：本文档由用户提供并上传，收益归属内容提供方，若内容存在侵权，请进行举报或认领

文档简介

Deep Predictive Autonomous Driving Using Multi-Agent Joint Trajectory Prediction and Traffi c Rules Kyunghoon Cho, Timothy Ha, Gunmin Lee, and Songhwai Oh AbstractAutonomous driving is a challenging problem because the autonomous vehicle must understand complex and dynamic environment. This understanding consists of predicting future behavior of nearby vehicles and recognizing predefi ned rules. It is observed that not all rules have equivalent values, and the priority of the rules may change depending on the situation or the drivers driving style. In this work, we jointly reason both a future trajectories of vehicles and degree of satisfaction of each rule in the deep learning framework. Joint reasoning allows modeling interactions between vehicles, and leads to better prediction results. A rule is represented as a signal temporal logic (STL) formula, and a robustness slackness, a margin to the satisfaction of the rule, is predicted for the both autonomous and other vehicle, in addition to future trajectories. Learned robustness slackness decides which rule should be prioritized for the given situation for the autonomous vehicle, and fi lter out non-valid predicted trajectories for surrounding vehicles. The predicted information from the deep learning framework is used in model predictive control (MPC), which allows the autonomous vehicle navigate effi ciently and safely. We test the feasibility of our approach in publicly available NGSIM datasets. Proposed method shows a driving style similar to the human one and considers the safety related to the rules through the future prediction of the surrounding vehicles. I. INTRODUCTION Research on autonomous vehicles has been grown ex- ponentially in the past few years. Recently, Waymos au- tonomous cars have successfully driven 8 million miles on public roads. Still, one of the greatest challenges to build fully autonomous cars is the understanding dynamic driving scene. Especially, the prediction of surrounding vehicles is diffi cult to achieve, since drivers have different driving patterns and are dependent on surrounding vehicles move- ment. In addition, the decision on how to drive based on the understanding of the surrounding environment remains a problem; Autonomous vehicles must be controlled taking into account the movements of surrounding vehicles and safety problems, such as traffi c regulations. Unlike other robot applications, rule information acts as a critical role in autonomous driving problem. It is important to observe that rules do not have the same importance and that there are rules that, depending on the situation, should be prioritized or, in some cases, ignored. For example, in autonomous driving, there may be a situation in which some rules must be disobeyed, such as changing lanes behind K. Cho, T. Ha, G. Lee, and S. Oh are with the Department of Electrical and Computer Engineering and ASRI, Seoul National Uni- versity, Seoul 08826, Korea (e-mail: kyunghoon.cho, timothy.ha, gun- min.leerllab.snu.ac.kr, songhwaisnu.ac.kr). This work was supported in part by Institute of Information From the predicted margin to satisfaction of each rule, the controller can fi gure out which rules to follow, rather than maintaining strict compliance with all rules. The predicted trajectories of the surrounding vehicles are also considered in the control procedure so that the autonomous driving vehicle can be operated safely. II. RELATEDWORK Path prediction. Due to advance in deep learning method, research on trajectory prediction has been extensively stud- 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Macau, China, November 4-8, 2019 978-1-7281-4003-2/19/$31.00 2019 IEEE2076 ied. Exploiting the power of RNNs for sequence-to-sequence modeling, there are many studies that predict future tra- jectories from history trajectories. A RNN encoder-decoder framework with conditional variational autoencoder (CVAE) has been applied for trajectory prediction in 5, 6. They are structurally similar to our method, but we additionally considered interaction between multiple vehicles. The in- teraction between humans has been considered in 79; They focused on how to effi ciently capture dependencies between multiple pedestrians. Although we consider that the interaction is similar to the study, we focus on capturing the dependencies between vehicles in autonomous driving scenarios and, based on this, we ensure that the correct control can be generated. Control under temporal logic specifi cations. Computing optimal control under temporal logic specifi cations has been considered in the context of linear temporal logic (LTL) and signal temporal logic (STL). Mixed-integer linear program- ming was proposed to generate trajectories for continuous systems with fi nite-horizon LTL specifi cations in 1012. Also sampling-based motion planning methods have been successfully fi nd an optimal trajectory meeting a given LTL specifi cation 13, 14. The MPC scheme has been applied to signal temporal logic (STL) 15, 16. MPC with STL specifi cations is formulated as mixed integer linear program- ming in 15, and authors in 16 use probabilistic predicates to reason about safety under uncertainty. III. PRELIMINARIES A. Vehicle Model We let the state of the system at time t be xt= xt,yt,t,vtT, where xt, ytdenote the position of the vehicle, tis the heading, and vtis the linear velocity. Further, the control inputs of the system is ut= wt,atT, where wtis the angular velocity, and atis the acceleration. The dynamic of the vehicle is defi ned as follows: xt= vtcos(t), yt= vtsin(t), t= vt1wt, vt= 2at, where 1,2are constants. For a fi xed horizon H, let x(xn,uH,n) be a generated trajectory starting from xnwith control inputs uH,n= un,.,un+H1. A signal is a sequence of states and controls, which is defi ned as: (xn,uH,n) = (xn,un),.,(xn+H1,un+H1).(1) With a slight abuse of notation, (n) is a signal starting from time step n. B. Signal Temporal Logic Signal temporal logic (STL) is a logical formalism which is able to specify the properties of real-value, dense-time signals 3, 4. A STL formula is a composition of boolean and temporal operations on the defi ned predicates. The notation (,t) ? denotes that a signal satisfi es the STL formula at time t. For example, (,t) ? Ga,b means that holds for the signal between t + a and t + b. One great advantage of STL is that it is provided with a metric called robustness degree that measures how well a given signal satisfi es a STL formula . The robustness degree can be defi ned as a real-valued function of signal and t. We apply a notation (,t) ? (,r) representing that the signal satisfi es the STL formula at time t with robustness slackness r, which can be defi ned as follows 2: (,t) ? (,r)(,t) r.(2) Eqn (2) states that the signal satisfi es at least with the minimum robustness degree r. The robustness slackness r acts as a margin to satisfaction of STL formula . As r increases, strong constraints for the signal to satisfy at time t are given, while relaxed constraints are granted for small r. Especially, when r 0, it allows violation of . 5 different rules are defi ned with the corresponding STL formulas = ? 1,.,5 ?, which have some meaning in autonomous driving situations. 1) Lane keeping (right): 2= yt yl,min 2) Lane keeping (left): 1= yt yl,max 3) Collision avoidance (Front vehicle): 3= (xt xc,min)(xt xc,max)(yt yc,min) (yt yc,max) 4) Speed limit: 4= vt vmax 5) Slow down before the front vehicle: 5= (vt vth)Uta,tb(xt xc,min) Also we let cbe the collision between vehicles beside the front vehicle. We denote robustness slackness of the STL rules as r. IV. PROBLEMDEFINITION Our objective is to fi nd a control sequence of some fi nite time horizon under the pre-defi ned rules in an autonomous driving environment. In order to compute control sequence of the autonomous vehicle, our approach requires two following terms: (a) future trajectories of other vehicles, (b) robustness slackness of each rule in . The fi rst term directly related to the safety of autonomous vehicle, and the second term helps the controller to imitate the behavior of human experts by knowing the lower bound of degrees of satisfaction for each rule. We assume that six nearby vehicles that are in adjacent lanes affect the control of the ego vehicle (Figure 1). Notation for vehicles on track and input feature are shown in Figure 1. We mark the vehicle to be controlled as Vego. The nearby ve- hicles are denoted as Vnear= Vlf,Vlr,Vcf,Vcr,Vrf,Vrr. From the rest of paper, we use the subscript ego for the ego vehicle, and lf,lr,cf,cr,rf,rr are used for the sur- rounding vehicles. A feature representation fegoconsists of distances to nearby vehicles Vnearand lane deviate distance fego= (dlf,dlr,dcf,dcr,drf,drr,ddev). Let X = stHX,.,st and Y= st+1,.,st+HY be previous and future trajectory , where st R3is the position xt,yt, heading tat time t and HX,HYare time horizons. regoand rnear= rlr,rlf,.,rrf refer robustness slackness of the STL formulas for the ego vehicle and the surrounding vehicles. V. PROPOSEDMETHOD The proposed method consists of four modules (Figure 2); encoder module, interaction module, prediction module and control module. Through the deep neural networks including encoder module, interaction module and prediction module, we jointly reason about the future trajectories and robustness slackness of STL formulas for both the ego vehicle 2077 Fig. 1: Vehicle and feature descriptions in the track driving scenarios with respect to the ego vehicle (blue). Up to six nearby vehicles are considered; The front and rear vehicles are considered, which are in the left, middle and right lanes with respect to the ego vehicle Vego. Vegoand near vehicles Vnear. Since our model makes joint predictions for vehicles Vego,Vnear, it has the capacity to model interactions between vehicles. From predicted robust- ness slackness and trajectories, the ego vehicle is controlled through MPC procedure under the rule constraints . Instead of the strict satisfaction of rule restrictions, the designed controller is able to decide which rules should be prioritized and how satisfi ed they should be depending on the situation. We choose to use long short-term memory units (LSTMs) 17 as the RNN in both encoder and decoder module. Next we will detail these four modules sequentially. A. Encoder Module The encoder module is designed to model pattern of vehicles, including different history path and direction. The history of trajectories for each selected vehicles are encoded through LSTM networks. As input to the encoder network, difference values are used for each points of the trajectory. For each input trajectory, single layer multi-layer perceptron (MLP) to get fi xed length vector . These embeddings are used as input to the LSTM cell of the encoder at time t introducing the following recurrence: et= (xt,yt,t;Wemb) ht= LSTM(ht1,et;Wenc) where () is an embedding function with tanh nonlinearity, Wembis the embedding weight. The parameters of LSTM are shared between both the ego vehicle and near vehicles. Notice that future trajectories are also encoded during the training procedure, and the encoded future trajectories are used in the interaction module. The encoder module returns LSTM state htfor each vehicle, and We will notate the output of the encoder module for each vehicle as follows: hx for history trajectories and hy for future trajectories. B. Interaction Module Based on the outputs of the encoder module, the inter- action module measures joint information between vehicles. We focused on two terms; The fi rst is to consider the diversity that can occur in the current situation (multimodality), and second, to exclude scenarios that can not occur (robustness slackness). As we approach from these two perspectives, we predict futures multimodally in a given situation and exclude invalid predictions among the various predictions based on robustness slackness. The interaction module composed of two components which will be described in the following sections. 1) Diverse sample generation: Since future prediction is inherently ambiguous, there are various interpretations of the present driving situation. In order to capture the ambiguity and multiple plausible scenarios, we use CVAE structure which is able to explain uncertainties and generate plausible scenarios, which is to learn a distribution of p(Y |X,fego). The CVAE framework introduces a latent variable z so that p(Y |X,fego) = R zp(Y |X,z)p(z|X,fego)dz. The latent variable z models inherent structure in the interaction of multiple vehicles, and it also helps to describe underlying ambiguity of future behaviors of other vehicles. The CVAE structure composed of the three neural net- works, which are generation network pg(Y |X,z), recogni- tion network qr(z|Y,X,fego) and (conditional) prior net- work pp(z|X,fego), where g,r,pare network parame- ters. qr(z|Y,X,fego) is trained to give higher probability to z that is likely to produce of prediction Y close to the ground truth Y . While at test time, z is sampled randomly from the prior distribution pp(z|X,fego) and transferred to the prediction module for generating a prediction candidate. Such a structure enables probabilistic inference which serves to handle multi-modalities in trajectory prediction. In this work, the distribution of the latent variable z is modeled as Gaussian distribution: z qr(z|Y,X,fego) = N(z,z), where z,zare outputs of fully-connected layers. Since back-propagation is not feasible through random sampling, reparameterization trick 18 is used to train the CVAE struc- ture. The KL divergence loss is considered for regularization: lKLD= DKL(qr(z|Y,X,fego)|pp(z|X,fego). 2) Concatenating layer: Apart from diver sample gener- ation, we have additionally examined how well each vehicle conforms to the rules . For this purpose, we have bundled the encoded information of adjacent vehicles for each ve- hicle. Concatenating layer plays the role described above, which is implemented in several MLPs, and returns con- catenated states c = cego,clf,clr,ccf,ccr,crf,crr (Figure 2). cegouses all encoded states of near vehicles Vnear, and other terms in c beside cegoonly use three spatially adjacent encoded states. Notice that we do not apply interaction models presented in 79 because we only consider up to 6 near vehicles and the previous works are suitable for predicting pedestrians in crowded scenes. C. Prediction Module Prediction module predict both future trajectories Y and robustness slackness r for each vehicle by incorporating output of encoder module and interaction module. For given history trajectory X and the sampled latent variable z, the LSTM decoder generates future trajectory with respect to the distribution p(Y |X,z) = QHY i=1p(st+i|X,z,s1:i1), where Y = st+1,.,st+HY. Notice that we model outputs of both trajectory and rule prediction parts as a Gaussian mixture model (GMM) to better represent the distribution of real data, and the input of the next LSTM cell is sampled from the GMM. During the test phase, Kpprediction candidates are gen- erated from the prior distribution z(k) pp(z|X,fego), i.e.,Y (1),., Y (Kp). We measure the robustness degree of 2078 Fig. 2: Overall procedure for the proposed deep predictive control framework. (a, b) Past trajectories and future trajectories of vehicles are encoded in encoder module. Encoded information are feed into the interaction module, which captures the current situation. Interaction module produces stochastic latent variable z and concatenated states c for each variables. The latent variable z helps to jointly predict plausible trajectories of vehicles, and concatenated states c are for predicting rule information of each vehicle, which is the robustness slackness of STL formulas. (c) Combined with encoded states and sampled latent variable z, joint prediction on future trajectories are conducted. With predicted robustness slackness of STL formula, we can fi lter out invalid prediction candidates. (d) Control module fi nds out the valid control from the predictions through optimization process. each prediction candidates and fi lter out invalid ones which have different value than the predicted robustness slackness r. This process helps us achieve more accurate results and allows the controller to increase the safety of the control. D. Control Module In the control procedure, model predictive control (MPC) under signal temporal logic 2 is conducted. The cost func- tion J is designed as quadratic so that the above optimization is now mixed integer quadratic programming.: J(x(xt,uH,t),uH,t) = 1 2(xt+H xg)|Qx(xt+H xg) + uH,t |Q uuH,t. The goal point xgis selected as the last state of the predicted trajectory Yego and modifi ed to the middle point of the lane. Since the rule 5is hard to be computed, we neglect the constraints related to 5during the optimization process when the predicted robustness slackness rego(5) is below the certain threshold. E. Implementation Details We select an arbitrary vehicle as Vegoand set its surround- ing vehicles as Vnear. Three modules including encoder mod- ule, interaction module and prediction module are trained with the following loss function: ltrain= lKLD+ 1 K K X k=1 X iVego,Vnear lk,i Y + lk,i r , where K is a sample number of latent variable z, lk,i Y and lk,i r refer negative log-likelihoods of GMM in both trajectory and robustness slackness prediction. During training phase in the LSTM decoder in prediction module with a rate of 10% the predicted value as the input into the next cell, otherwise the ground truth value from the training data. VI. EXPERIMENTALRESULTS We conducted a reali

人人文库> 全部分类> 教育资料 > 课件下载

温馨提示

1. 本站所有资源如无特殊说明，都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2. 本站的文档不包含任何第三方提供的附件图纸等，如果需要附件，请联系上传者。文件的所有权益归上传用户所有。
3. 本站RAR压缩包中若带图纸，网页内容里面会有图纸预览，若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 人人文库网仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对用户上传分享的文档内容本身不做任何修改或编辑，并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容，请与我们联系，我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

IROS2019国际学术会议论文集2013

文档简介

温馨提示

最新文档

评论

IROS2019国际学术会议论文集2013

文档简介

温馨提示

最新文档

评论

相关文档