版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Towards Reversible Dynamic Movement Primitives Iigo Iturrate1, 2, Christoffer Sloth1, Alja Kramberger1, Henrik Gordon Petersen1, Esben Hallundbk stergaard2, and Thiusius Rajeeth Savarimuthu1 1Maersk McKinney Moller Institute, University of Southern Denmark, Odense, Denmark inju, chsl, alk, hgp, trsm
2、mmi.sdu.dk 2Universal Robots A/S, Odense, Denmark iit, esbenuniversal- AbstractIn this paper we present an initial approach towards reversible robot movement primitives. Our approach is a modifi cation of Dynamic Movement Primitives (DMPs), a widely used framework for robot learning from demonstrati
3、on. DMPs are based on dynamical systems to guarantee properties such as convergence to a goal state, robustness to perturbation, and the ability to generalize to other goal states. Yet a main limitation of their original formulation is that they do not allow for movements to be reversed. Thus, to ex
4、ecute the same task forwards and backwards would mean to learn two separate primitives. We propose to replace the transformation system in DMPs with the Logistic Differential Equation (LDE), a known time-reversible non-linear system. Similarly to the original DMP formulation, our systems temporal ev
5、olution is controlled by a phase system, which in our case is derived from the LDE to guarantee reversibility. We evaluate our approach experimentally with demonstration data from a real robot assembly task, and show comparable properties to those of the original DMP system. I. INTRODUCTION The past
6、 decade has seen a growing trend towards robot automation in industrial manufacturing in order to lower production costs. At the same time, the number of Small- and-Medium Enterprises offering customized small batch production has risen. Both of these trends have been enabled by the adoption of chea
7、p and easy-to-use automation solu- tions, particularly in the form of collaborative robots. These robots make use of intuitive graphical user interfaces and hand-guided via-point teaching to allow non-expert users to quickly develop robot applications. While collaborative robots have lowered the ent
8、ry barrier for industrial automation, there is a strong push for methods that are even more user-friendly. One such method is pro- gramming by demonstration, by which a user shows the robot how to perform a task either by performing it themselves, by guiding the robot kinesthetically 1 or by other m
9、ethods 2 and the robot learns the corresponding skill. Within programming by demonstration, Dynamic Move- ment Primitives (DMPs) 3 is one of the most popular frameworks for learning skills. DMPs can be learned from a single demonstration, as opposed to the larger datasets required by statistical met
10、hods. This has practical value in cases where quick task setup is important, such as industrial applications with small batch sizes and quick changeover requirements. With a basis in the neuroscientifi c domain 4, initial work with DMPs in robotics focused on imitation learning 5, 3, 6 and was later
11、 utilized in several other robotic domains, such as humanoid bipedal locomotion 7, adaptive frequency modulation for periodic movements 8, 9, reinforcement learning of task parameters 10, 11, and generation of motor skill libraries 12, 13, 14 for automatic trajectory generation. One of the advantage
12、s of this framework is that movement primitives can be easily generalized to new goal confi gu- rations 15 and the speed of execution can be changed. Furthermore, because their formulation is grounded in dy- namical systems theory, DMPs can be extended to exhibit more complex behaviors, such as obst
13、acle avoidance 16 or hybrid position-force control 17. All of these characteristics make them versatile and applicable in a variety of contexts and tasks. In recent work, the area of robotic assembly has gained attention. Here, the goal is to not only consider the kinematic trajectory, but also the
14、measured forces and torques arising during task execution 18 to adapt the trajectory to fulfi ll the task requirements. For safe interaction with the environment, force feedback can directly be coupled within the DMP on the velocity and acceleration levels 19. When incorporating compliant coupling t
15、erms, the overall stability of the system must be considered 20 to ensure a safe execution. Within industrial assembly, reversibility of the motions is also of general interest. In a fi rst use-case, a reversible motion representation could be used to both assemble and disassem- ble a workpiece, or
16、palletize and depalletize. Perhaps more importantly, reversibility has proven to be an effective error- recovery strategy for a high percentage of tasks 21, 22. Here, if the robot contact forces exceed the nominal values expected for the task, the robot can backtrack to a previous point in the assem
17、bly trajectory and try again. Because of the inherent sensor uncertainty in the assembly of low-tolerance parts, this retrial strategy will often succeed 21. Dynamic Movement Primitives are unable to reverse the trajectory, because the second-order spring-damper system becomes unstable in the backwa
18、rds direction. This means that it is necessary to learn separate primitives for the forwards and backwards directions. To the extent of our knowledge, this issue has only been discussed by Nemec et al. 23. In their work, the user is able to incrementally teach a correction term that scales the demon
19、stration trajectory in time, including the possibility of reversing it. However, to 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Macau, China, November 4-8, 2019 978-1-7281-4003-2/19/$31.00 2019 IEEE5063 allow backwards execution, the authors switch to a second DMP
20、 learned from the backwards trajectory, in order to avoid instability. In this paper we present a reformulation of Dynamic Movement Primitives that uses the time-symmetrical prop- erties of the Logistic Differential Equation as a basis for constructing a reversible DMP system. In doing so, we attemp
21、t to preserve the main properties of traditional DMPs. This manuscript is structured as follows: First we review the original formulation of DMPs and formalize their main properties as suggested in the literature in section II. We then formalize the requirements for our defi nition of reversibility
22、in section III. Upon this, we present our suggested approach towards constructing reversible DMPs from the Logistic Differential Equation in section IV. Section V then analyzes our system with respect to reversibility and stability, and the properties of DMPs. Our method is then evaluated experi- me
23、ntally in both simulation and on a real robot in section VI, followed by a discussion on its strengths and limitations in section VII. We conclude this work in section VIII. II. DYNAMICMOVEMENTPRIMITIVES Dynamic Movement Primitives are mathematically for- malized as a set of transformation systems (
24、1), one per output dimension, which control the evolution of the trajectory, and a canonical system (2), which controls temporal evolution of the system and provides coupling between dimensions: 2 y = y(y(g y) y) + f(x),(1) x = xx,(2) where g represents the goal state or attractor of the system, y,y
25、,x R+are positive gain constants and R+is a time constant. The term f(x), called the forcing term, in (1) is typically defi ned as follows: f(x) = PN i=1i(x)wi PN i=1i(x) x(g y0),(3) i(x) = exp (x ci)2 22 ! ,(4) where N Z is the number of Gaussian basis functions, i (4), with centers ci, variance ,
26、and weights wi. The constant y0in (3) corresponds to the initial state of the transformation system (1). If the parameters are selected as 0,y= 4 0, the system is critically damped and has a unique point attractor at y = g. Notice that, excluding the f(x), (1) corresponds to a second-order spring-da
27、mper system. Term f(x) introduces non-linear dynamics to the system and can be learned to gen- erate any arbitrary smooth desired trajectory. Furthermore, the system (1) is not explicitly dependent on time, but is instead dependent on a phase variable, x. Because x appears in (3), it will eventually
28、 gate out the infl uence of f(x) in (1) as it converges to zero, thus guaranteeing convergence of the transformation system to its attractor state g. Since f(x) is linear in its parameters wi, learning a desired trajectory can be done by linear regression. In this case, given a demonstration traject
29、ory, ydemo, ydemo, ydemo, a desired forcing term that results in the demonstration movement can be calculated according to: fdesired= 2 ydemo y(y(g ydemo) ydemo)(5) From (5), a least-squares weighted linear regression prob- lem can be formulated in order to approximate fdesiredusing f, as defi ned i
30、n (3). Details on the parameter learning can be found in 3. III. REQUIREMENTS FOR THEREVERSIBILITY OFDMPS In order to formulate a reversible DMP, we must fi rst defi ne a set of design requirements needed to obtain the desired behavior. First of all, a DMP must be globally asymptotically stable. The
31、 basic minimum requirement for it to be globally stable in both the forwards and backwards directions is that the underlying dynamical system must have two critical points: one at the goal state, which must be an attractor in the forwards direction, and another at the start state, which must be an a
32、ttractor in the backwards direction. Furthermore, it would be preferable if the underlying dynamics of the transformation system were reversible. This would mean that the systems trajectory would be identical in both directions, allowing us to use the same forcing term for backwards execution as use
33、d for forwards execution. A dynamical system is reversible if there exists an involution in phase space which reverses the direction of time 24. Specifi cally, let x Rnand f : Rn Rn, then dx dt = f(x)(6) is reversible if there exists a map G : Rn Rnwhere G G = Id, such that dG(x) dt = f(G(x).(7) Not
34、ice that according to this defi nition, the existence of an attractor in a dynamical system implies the existence of a repeller in the corresponding reverse time system. It is therefore impossible to have global stability and reversibility simultaneously. Instead, either of the requirements must be
35、relaxed. A. The Logistic Differential Equation We now consider a dynamical system which fulfi lls the re- quirements of having two critical points and being reversible as per (6) and (7). An example of such a system is the logistic differential equation (LDE) 25: y = (y a)(b y),(8) which is reversib
36、le since the involution G : y 7 a + b y reverses the direction of time for the system where a 6= b R. As previously stated, this system has an attractor at the goal, b, and a repeller at the starting state, a for the forwards system, and the points switch roles for the reverse system. 5064 This impl
37、ies that the system is not globally asymptotically stable (a stable alternative will be introduced in section III- B). Consider the logistic differential equation (8), and let a b, then the region of attraction of y = b is (,a) R and likewise for the reverse system the region of attraction of y = a
38、is (b,) R. We can now re-write the system (8) in a form more similar to that of DMPs (1) and (2), with a goal state g and start state y0, and with positive gain R+as follows: y = (g y)(y y0).(9) However, this parametrization based on is not ideal, because we would like to be able to modulate speed a
39、nd direction of execution simultaneously and smoothly. With , smooth modulation of speed would imply a discontinuity in , as y 0 when . With this in mind, we redefi ne the system as: y = fLDE(y) = (g y)(y y0),(10) where R can be interpreted as = 1 , i.e. a speed- scaling instead of time-scaling cons
40、tant, such that positive values of move the system forward, negative values move it backward, and a value of zero halts it. From now on, and for the rest of this paper, we will assume y0 g. The regions of stability of (10) can then be written as: Rforward= y R | 0, y = g + (1 )y0,(11) Rreverse= y R
41、| 1, y = g + (1 )y0,(12) for the forwards and backwards systems, respectively. A plot of the function (10) can be seen in Fig. 1. -101234567 -3 -2 -1 0 1 2 3 Fig. 1: Graph of the generalized logistic equation (10) with g = 5 and y0= 1. Notice that the system is time-reversible, i.e. yreverse= yforwa
42、rdand yreverse= yforward. B. The Stabilized Logistic Differential Equation In order to satisfy the requirements that a DMP should be globally asymptotically stable towards the attractor g, we propose to stabilize the LDE by reversing the sign of y in the unstable region. Thus, we defi ne the Stabili
43、zed LDE (SLDE) system piecewise as follows: yforward= ( fLDE(y)if y Rforward fLDE(y)otherwise, (13) yreverse= ( fLDE(y)if y Rreverse fLDE(y)otherwise, (14) fSLDE(y,) = ( yforwardif 0 yreverseotherwise. (15) Note that the system is now globally stable. However, stabilizing the system has in turn resu
44、lted in limiting its reversibility, such that it is only reversible for y (y0,g), as shown in Fig. 2. As stated earlier, this compromise is nec- essary if global stability and reversibility are simultaneously needed, as reversibility precludes global stability. -101234567 -3 -2 -1 0 1 2 3 Fig. 2: Gr
45、aph of the stabilized generalized logistic equation (10) with g = 5 and y0= 1. Notice that the system is now only time-reversible for y (y0,g). IV. REVERSIBLEDYNAMICMOVEMENTPRIMITIVES With the requirements and considerations regarding re- versibility established in section III as a basis, we now ref
46、ormulate DMPs using the SLDE. Based on (15), we reformulate the transformation system: y = fSLDE(y,) + p(16) p = f(z)(17) The new phase system, renamed as z to account for the different dynamics, is given by a logistic differential equation with y0= 1 and g = 1: z = z(1 z)(z + 1),(18) where gain con
47、stant z R+controls the speed of conver- gence. The new forcing term f(z) is defi ned as follows: f(z) = PN i=1i(z)wi PN i=1i(z) v(z) (g y0).(19) The weighted Gaussian basis functions are defi ned as in (4), but as a function of z instead of x. We introduce a separate gating term given by v : 1,1 R a
48、s: v(z) = ?1 z2m?1 2m ,(20) 5065 with m R+. Note that this choice of gating system solves a drawback of the term defi ned in (2), namely that the weights, wiof the forcing term will take large values towards the end of the trajectory, due to the gating terms convergence to zero. An alternative gatin
49、g system that also solves the scaling problem of wihas been presented in 26. V. SYSTEMANALYSIS A. Partial vs. Complete Reversibility As mentioned in section III-B, making the system globally stable limits its reversibility to the region y (y0,g). Outside this, the dynamics are not reversible. Due to
50、 this, a compromise must be reached depending on the properties desired from the system. Notably, there are, in theory, two main alternatives with our system: 1) If the system must be globally stable, a separate forcing term must be learned for the forwards and reverse trajectories, to account for t
51、he non-reversibility in regions outside of y (y0 ,g). Thus, (17) is redefi ned as: p = ( fforward(z)if 0 freverse(z)otherwise, (21) where fforward(z) and freverse(z) are forcing terms de- scribed according to (19) and trained on the forwards and backwards executions of the demonstration trajec- tory
52、, respectively. 2) If global stability is not a concern, or if the system always operates within y (y0,g), the non-stabilized generalized logistic equation in (10) can be used as the foundation of the transformation system. Note that perturbing the system beyond g or y0could be po- tentially dangero
53、us, as it would then become unstable with fi nite escape time. We therefore do not consider this a practical solution, and will not discuss it for the remainder of this work. B. Properties with Respect to Dynamic Movement Primitives As formulated by Ijspeert et al. 3, the design of a point to point
54、discrete (i.e. non-rhythmic) DMP is based on a series of design principles that defi ne the properties that the system should follow. Thus, when formulating Reversible DMPs, we took these principles into consideration. We now briefl y outline how our approach relates to this. The original principles
55、 of DMPs 3 are stated in bold, and then followed by their equivalent in the Reversible DMP system: 1) Guaranteed congergence to a point attractor (the goal g): Reversible DMPs have a stable global attractor at the goal g. However, in order for the system to be reversible, the initial state y0must al
56、so be a critical point of the system (see section III), as the points will switch role in the reversed system. In our stabilized LDE-based system, due to the switched dynamics, y0 is a saddle point. This is theoretically problematic, because the systems evolution would halt if it were to reach this
57、point with zero velocity and because, initially, it would never be able to escape the start state. In practice, the forcing term will push the system away from y0, such that the above will not be an issue, without any modifi cation to the target trajectory path. Furthermore, in a real computer syste
58、m, residuals from fl oating-point approximation will make it very unlikely that the system will get stuck in the saddle point. 2) The system should be autonomous, without explicit time dependence and allow for time modulation, i.e. speed-up or slow-down: Since the generalized stabilized LDE on which
59、 our system is based is au- tonomous, and the rest of the terms forcing term and phase are formulated in a way that is analogous to the original DMP system and thus also autonomous, this property is retained. 3) Multi dimensional dynamical systems need to be coordinated in a stable way: Similarly to the original DMP formulation (2), this is done through the phase system (18). 4) Open parameters should be easy to learn in order to reproduce the desired movement, i.e. the model should be linear in the para
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026年全国一级注册建筑师之建筑结构考试黑金试卷详细参考解析
- 初中八年级地理《矿产资源与区域可持续发展》教案
- 本科一年级“学业规划与个人发展”课程《学习历程梳理与发展蓝图构建》教案
- 八年级数学(苏科版)上册实数全章知识清单【核心版】
- 八年级道德与法治《国家司法机关:守护公平正义的最后防线》教学设计
- 西北农林科技大学2026年强基计划面试+体育测试模拟试题及答案解析
- 2026年物流主管竞聘笔试题集
- 2026年法律顾问资格考试仿真题精
- 2026年自然灾害应对指南题解
- 2025辽宁营口港口公共基础设施资产管理有限公司面向市属国有企业公开选聘2人笔试历年参考题库附带答案详解
- 产品设计制图与图纸标准化手册
- GCP法规培训教学课件
- 呼吸阀阻火器培训课件
- 移动通信基站维护工作标准流程
- 2025版NOF防治骨质疏松症临床指南解读
- 2025辽宁锦州海通实业有限公司社会招聘7人笔试考试参考试题及答案解析
- 八项硬措施解读课件
- T-BJQC 202301-2023 智能座舱界面与人机交互主观评价技术规范
- DB52-T 1777-2023 民用醇基液体燃料安全技术规程
- 2025年大学《监狱学-狱政管理学》考试备考题库及答案解析
- 项目经理讲安全课课件
评论
0/150
提交评论