IROS2019国际学术会议论文集0832_第1页
IROS2019国际学术会议论文集0832_第2页
IROS2019国际学术会议论文集0832_第3页
IROS2019国际学术会议论文集0832_第4页
IROS2019国际学术会议论文集0832_第5页
已阅读5页,还剩2页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、Fast Motion Planning via Free C-space Estimation Based on Deep Neural Network Xiang Li1, Qixin Cao2, Mingjing Sun3, Ganggang Yang4 AbstractThispaperpresentsanovellearning-based method for fast motion planning in high-dimensional spaces. A deep neural network is designed to predict the free con- fi g

2、uration space rapidly given the environment point cloud. With a generated roadmap as an approximate view of the free C-space, LazyPRM is applied to fi nd and check the path with A* search. Due to the application of LazyPRM, the presented method can preserve probabilistic completeness and asymptotic

3、optimality. The new algorithm is tested on a 3-DOF robot arm and a 6-DOF UR3 robot to plan in randomly generated obstacle environments. Results indicate that compared to planners including PRM, RRT*, RRT-connect and the original LazyPRM, our method is of the lowest time consumption and relatively sh

4、ort path length, showing good performance on both planning speed and path quality. I. INTRODUCTION Motion planning algorithms aim to generate collision-free motions for complex bodies from a start to a goal position among a collection of static obstacles 1. These algorithms are widely used in robot

5、navigation, robotic surgery and robotic manipulation. Sampling-based planners are very popular methods for solving motion planning problems and can guarantee proba- bilistic completeness and asymptotic optimality. The acceler- ation of sampling-based motion planning is one optimization direction tha

6、t researchers focus on 2 3 4 . Faster planning speed can be benefi cial for variable environments, specifi cally for robots working in changing environments or robot arms on a moving platform that can change their positions. Recent work shows that using environment data to set up a heuristic single-

7、query planner can increase planning speed 5. The key idea of sampling-based planners is to exploit the free C-space by sampling and collision checking between the robot and the environment. This is because an explicit representation of the obstacles in the confi guration space may result in an exces

8、sive computational burden in high dimen- sions, and in environments described by a large number of obstacles 6. Neural networks can perform well when fi tting complex functions and might be helpful to predict the free C-space with the information of the obstacles. This paper presents a novel learnin

9、g-based method to accelerate sampling-based motion planning. A deep neural network is designed to estimate the free C-space from the point cloud of an environment. Figure 1 shows a prediction 1,3,4 MasterstudentsofSchoolofMechanicalEngineering, ShanghaiJiaoTongU, smj 1024, 2 Professor of School of M

10、echanical Engineering, Shanghai Jiao Tong U - 3 0 2 0 0 1 - - - 3 0 2 0 0 1 - - Fig. 1.Free C-space predicted by deep neural network for a 3-DOF robot arm in an obstacle environment. Left is the ground truth roadmap vertices and on the right is the prediction. The input of the network is the point c

11、loud of the environment and the output is a set of robot confi gurations indicating the C-free. of the free C-space of a 3-DOF robot arm in an obstacle environment. PointNet 7 is adopted in our network for point cloud processing. A roadmap vertex set is generated directly by the DNN with Earth Movin

12、g Distance (EMD) as the loss of the network to maintain the unordered feature of the graph vertices. Then the output vertices are rewired, and paths are found by A* search algorithm. LazyPRM 8 is applied to reduce the number of collision checking and to ensure probabilistic completeness and asymptot

13、ic optimality. The roadmap need only to be generated once in each different environment, and can be used for multiple planning problems in the same environment, making our multi-query planner suitable for mobile manipulators where the robot arm often faces different environments and may need to perf

14、orm multiple tasks in the same environment. The algorithm is tested on a 3-DOF robot arm and a 6-DOF UR3 robot to plan in randomly generated obstacle environments. Results indicate that compared to planners including PRM, RRT*, RRT-connect and original LazyPRM from OMPL 9, averagely, with a well-gen

15、erated roadmap from the network, our algorithm can fi nd a relatively low- cost path for the planning problem in the least time. This paper begins with the background of sampling-based planning. Section II then shows the classical sampling-based planners and previous work for sampling-based fast pla

16、nning and learning-based planning. In Section IV we describe our method in detail. Experiments are shown in Section V. Finally Section VI concludes this paper. 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Macau, China, November 4-8, 2019 978-1-7281-4003-2/19/$31.00

17、 2019 IEEE3542 II. RELATED WORK A. Sampling-based motion planning Sampling-based planners are one of the main categories of robot motion planning algorithms. Instead of providing the exact geometric model of the C-space, sampling-based plan- ners try to describe the C-space by sampling and checking

18、whether these sampled robot confi gurations collide with the obstacles. In this way, sampling-based planners can solve the planning problem in a more computationally effi cient way. The exchange of the computational effi ciency is that sampling-based planners provide a weaker, but still interest- in

19、g, form of completeness: probabilistic completeness, which means if a solution path exists, the planner will eventually fi nd it 1. Sampling-based planners can be mainly divided into two classes: single-query planners and multi-query planners. Rapidly-exploring random tree (RRT) algorithm 10 is the

20、main representative of single-query planners. This algorithm solves a planning problem by trying to grow a tree from start confi guration to the goal. Its bidirectional derivative RRT-connect 11 is famous for its fast speed and another derivative RRT* 12 can solve motion planning problems with asymp

21、totic optimality. Multi-query planners can plan faster than single-query planners with a precise pre-generated roadmap, but lose this advantage in new environments. The probabilistic roadmap (PRM) method 13 is a representative of the multi-query planners. Its asymptotic optimal derivative is PRM* 12

22、. Another derivative of PRM, LazyPRM 8, can reduce the planning time using lazy collision checking. B. Fast motion planning Various research has been done to accelerate motion planning, mainly for robots working in unstable environ- ments, specifi cally, a robot in an environment with moving obstacl

23、es or the robot arm is being moved, for example, a mobile manipulator. One main approach was to modify the algorithms to re-use information to effi ciently update motion paths when the environment changes 14. Paper 2 presented Dynamic Roadmap (DRM) algorithm, which accelerates planning speed by work

24、space discretization and using pre-generated collision checking data, and was able to do real-time planning in changing environments 3. How- ever, this algorithm cannot guarantee completeness because the workspace needs to be discretized into cells for fast collision checking, and will face diffi cu

25、lty while approving accuracy since the number of cells to check increases with the reciprocal of the third power of cell size. The method in 4 proposed another solution that can guarantee probabilistic completeness. LazyPRM was adopted as the planner of this method planning with a pre-generated road

26、map and online estimation of vertex collide possibility. But collision checking results in the same environment are needed to set up a precise model, and the prediction model can lose effectiveness in a new environment. Recent years, machine learning technologies have been widely concerned in roboti

27、cs and were adopted to accelerate robot motion planning. Paper 15 tried to optimize the sampler by clustering-based unsupervised region identifi ca- tion with a small roadmap and was able to apply different sampling strategies according to the feature of the region. Another method, Lightning Framewo

28、rk 16, tried to re- duce computation time by learning from experience. This framework consists of two main modules running in parallel, a planning-from-scratch module, and a module retrieves and repairs paths stored in a path library. However, the path library may not perform well for new environmen

29、ts where obstacles change a lot. Recent work in paper 17 used conditional variational autoencoder to accelerate the planning process by non-uniform sampling strategies that favor sampling in regions where an optimal solution might lie. The network is trained for a specifi c environment so that this

30、method is not able to address motion planning problems in different environments. Another recent work, 5, presented a deeply informed approach for the sampler to predict the well-optimized results generated by RRT* with the data of the environment given in the form of a point cloud. However, the dee

31、p sampler can only produce one sample under each condition, leading to running neural network forward propagation multiple times in solving one problem. Its performance is highly related to the training quality of the sampler; therefore, the performance of the whole algorithm is sensitive to the qua

32、lity of the dataset. Since RRT* is an asymptotic optimal planner and may cost much time to get well-optimized and each environment in the dataset need several generated paths, this algorithm might suffer from the diffi culty of dataset generation. III. PROBLEM FORMULATION The goal of this work is to

33、 fi nd a relatively low cost solution for motion planning problems in a short time in dif- ferent environments. The formulation of the motion-planning problem follows as presented by Karaman and Frazzoli 6. Let X = (0,1)d be the confi guration space, where d N, d 2. Let Xobsbe the obstacle region, s

34、uch that XXobsis an open set, and denote the obstacle-free space as Xfree= cl(XXobs), where cl() denotes the closure of a set. The initial condition xinitis an element of Xfree, and the goal region Xgoalis an open subset of Xfree. A path planning problem is defi ned by a triplet (Xfree,xinit,Xgoal).

35、 Problem 1 (Feasible path planning) Given a path plan- ning problem (Xfree,xinit,Xgoal ), fi nd a feasible path : 0,1 Xfreesuch that (0) = xinitand (1) cl(Xgoal), if one exists. If no such path exists, report failure. Let a set of all feasible paths be denoted as . Let a cost function c() computes a

36、 summation of Euclidean distances between all the consecutive states in a path . Problem 2 (Optimal path planning) Given a path planning problem (Xfree,xinit,Xgoal) and a cost function c : R0 , fi nd a feasible path such that c() = minc() : is feasible. If no such path exists, report failure. 3543 P

37、ointNet n x 3 1024 MLP m x d input points environment feature output vertices Fig. 2.Architecture of the roadmap generation network. n x 3 1 x 1024 input points PointNet 1 x1x64 CONV n x 64 1 x1x 64 CONV n x 64 1 x1x64 CONV n x 128 1 x1x1024 CONV n x 1024 max pooling environment feature n x 64 1 x1x

38、128 CONV Fig. 3.Architecture of PointNet (vanilla) 7. IV. ALGORITHM IMPLEMENTATION In this section, we present our algorithm, including a deep neural network that can generate C-space roadmap vertices for different environments to accelerate motion planning and a planner that can utilize the results

39、 of the roadmap gen- eration network (RGN). This section also includes specifi c information about the generation of training dataset. A. Network architecture Our network consists of two parts: the fi rst part to extract features of the environment point cloud and the second to generate vertices fro

40、m the features to build a C-space roadmap (Figure 2). In the fi rst part of our network, PointNet 7 is used to extract features of the environment point cloud that can be obtained from 3-D sensors (Figure 3). PointNet is an architecture that can effectively process unordered point sets by extracting

41、 point features through convolutional layers and aggregating information from all the points through a max- pooling layer. It is a simple but highly effi cient and effective architecture originally created for object classifi cation, part segmentation and scene semantic parsing. In order to use this

42、 architecture to solve motion planning problems, the simplest version of PointNet, the vanilla version (without point cloud transformation layers) is used for higher performance. This part of network consumes a n 3 data matrix where n is the number of input points and outputs a 11024 vector as the f

43、eature of the environment point cloud. The second part of the network aims to reconstruct the C- space roadmap from the feature of the environment point cloud. A multilayer perceptron (MLP) is applied in the structure with ReLU activation function. The layer number of the MLP will be further discuss

44、ed in later sections. Each fully-connected layer is followed by a dropout layer (Figure 4). The input of this part of the network is the 1 1024 feature vector generated by PointNet. The output is a m d matrix where d is the dimension of the C-space and m is the number of output roadmap vertices. 102

45、4 m x d environment feature output vertices 1024 FC Dropout 1024 FC Dropout m x d FC multilayer perceptron (MLP) Fig. 4.Architecture of the multilayer perceptron module. The Earth Movers Distance (EMD) 18 between the vertices of dataset roadmap and the reconstructed roadmap is adopted as the loss of

46、 the network. EMD is the solution of a transportation problem and is defi ned as the least cost that one set transforms to the other. For two equally sized subsets S1 Rd; S2 Rd , their EMD is defi ned by dEMD(S1,S2) =min :S1S2 X xS1 kx (x)k2 where is a bijection 19. The EMD distance is symmetrical a

47、mong all points in the set and can be easily adopted to high- dimensional spaces, which is suitable for our application. In our method, the distance between two confi gurations is set a weighted Euclidean distance with coeffi cients differ among joints. B. Dataset generation In order to train the pr

48、oposed network, the needed dataset includes different environment point clouds and the cor- responding roadmap vertices. The obstacle environment is generated randomly from four kinds of basic 3D objects (sphere, cube, ring and cylinder) (Figure 6). Points are only placed on the surface of the objec

49、ts to simulate real data collected from 3-D cameras. This can also be more compu- tational effi cient while enhancing accuracy because in this case number of voxels to check increases with the reciprocal of only the second power of voxel size. The number of each kind of object in one environment is

50、random and each object is transformed by a random homogeneous transformation matrix. Points too far away that will defi nitely not collide with the robot are removed. Then, for computation effi ciency, the point cloud is processed by a PCL VoxelGrid fi lter. After all the process, the number of poin

51、ts varies from approximately 5000 to 40000. Considering the balance of a smaller neural network for higher performance and enough points for roadmap prediction accuracy, 16384 (=214) points are selected randomly in each piece of data. Points may be selected repeatedly when there are not enough in on

52、e point cloud; our network can handle repeated points well since the same points will not affect the feature that came out from a max-pooling layer. After creating the obstacle point cloud, a roadmap in C- space is generated for this environment. As planning in open C-space regions is not the bottle

53、neck of motion planning, a novel algorithm is presented for roadmap dataset generation (Algorithm 1) to bias samples to complex regions. Our roadmap generator starts with an empty vertex set V and 3544 Algorithm 1: Roadmap dataset generator 1V ;E ;t 0; 2Start timing on t; 3while t sizeminthen 8conti

54、nue; 9else if SameComponent(Xconnectable) then 10continue; 11else if InterConnectable(Xconnectable) then 12continue; 13else 14AddV ertex(xrand,V,E) 15return V ; an empty edge set E, and generate time is limited for each environment. While not running out of time, a sampler fi rst tries to sample a v

55、alid robot confi guration xrand. Then a neighbor fi nding function Neighbor() would give the Nneighbornearest vertices of xrandin V , this result is stored in Xnear. Then, the direct path from xrandto every vertex in Xnearis tested by the collision checking func- tion CheckMotion() and connectable v

56、ertices are stored in Xconnectable. After this, three conditions are checked sequentially and xrandwould be added to the graph if any condition is satisfi ed. The order of the three judgements is determined by computational cost, from low to high. The fi rst condition is the vertex number in Xconnec

57、table. Small vertex number in Xconnectablemeans that many of near neighbors cannot be reached directly by xrand, the C-space near xrand is complexed and new vertices are needed. Checking the size of Xconnectableis quick since it is represented in one array in our algorithm. The second condition is w

58、hether all the vertices in Xconnectablebelong to the same component of the graph. Vertices that can reach each other through edges are defi ned in the same component. xrandwill be added to the graph if it can connect different components. The judgement of this condition would only need to check the

59、existing data. The third condition to check is whether all vertices in Xconnectablecan directly reach each other. This condition also indicates the region complexity near xrand. The computational cost is high because collision checking is needed in this process. When the time meets the limit, the program stops adding new confi gurations to the roadmap, all the vertices in V are stored together with the environment point cloud as a piece of data. C. Planner for the generated roadmap During online planning, the roadmap generation network generates a roadmap vertex set

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论