付费下载
下载本文档
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Lesson content:IntroductionSimplifying the Model Parallel ExecutionTechniques for Reducing CPU TimeTipsLesson 9: Managing Large Models: Part 190 minutesIntroduction (1/2)What is a Large Model?Difficult to defineAbsolute size changes as computers e more powerful.Rough measures for large models based
2、on number of variables:1980: Models with 100,000 variables 1990: Models with 1,000,000 variables2000: Models with 5,000,000 variablesTurn-around time is a universal measureThis measure includes jobs with relatively few variables and a huge number of increments.Models that take at least 12 hours (ove
3、rnight) to run are generally considered “large.Introduction (2/2)Large models often require:Simplification of the model geometry, physics, element formulation, etc. Intelligent use of system resources:disk space, available processors, memoryMinimization of CPU time, number of increments, etc. These
4、techniques will be discussed in this lecture Several techniques are available to assist in the management of large models:Submodeling Restart Parts and assemblies These techniques will be discussed in the next lecture. Simplifying the Model (1/14)There are a number of techniques that can be used to
5、simplify a model:Geometry simplificationModeling fewer detailsRigid body prototypingSolid element formulation Small-strain shellsSimplifying the Model (2/14)Simplify geometryWhen building a complex model, start simple (and small), then add complexity gradually.Making careful choices regarding your g
6、eometry and mesh:Take advantage of symmetry axial, planar, quarter, half, cyclic,Defeature the geometryAvoid small features They result in small elements which in turn reduce the stable time increment.Defeaturing a mechanical componentSimplifying the Model (3/14)Model fewer detailsConsider what you
7、want from the analysis and reduce the level of modeling detail in regions that are less critical to the analysis goals.For example:If you have two parts connected to each other and if you are not interested in detailed stresses of the connection, use connector elements.Use beams or shells instead of
8、 solids when appropriate. Focus mesh in regions of interest and use a coarser mesh elsewhere.Simplify the physics. Can a stiff portion of an assembly be replaced with a rigid body or a boundary condition?Do you need all interactions?Simplifying the Model (4/14)Example: Simplified model of a sports b
9、all for use in an impact analysis.The ball consists of a solid core and a shell cover.The detailed behavior of the ball is not of interest.The ball is reduced to the following:Analytical rigid surface for ball exterior contact surfaceLumped mass and rotary inertia for coreLumped mass and rotary iner
10、tia for coverConnector elements for stiffness and damping between:Core and coverCover and rigid exterior surfacecorecoverSchematic of modeled ball with a quarter of the cover cutawayBall ModelSimplifying the Model (5/14)Prototype with rigid bodiesMake deformable parts rigid with a rigid body constra
11、int for faster initial runs of the analysis.Once you have verified the model definition, remove the rigid body constraints.Simplifying the Model (6/14)Example: Drop Test Simulation of a Cordless MouseRigid body constraints are used so that initial versions of the analyses run quickly.*RIGID BODY, EL
12、SET=RIGID-BASE, REF NODE=BASE-RP, POSITION=CENTER OF MASSMouse baseMouseRIGID-BASE(white elements)Deformable elements (red)Simplifying the Model (7/14)Example (contd): Drop Test Simulation of a Cordless MouseFinal deformed shapesDeformable regions in colorfully deformable mouserigid prototype mouseM
13、ouse type Portion of mouse that is deformableRelative CPU timerigid prototype23%0.35fully deformable100%1Simplifying the Model (8/14)Solid element formulationAbaqus/Explicit offers three alternative kinematic formulations for the C3D8R solid element:The average strain formulation (default)Most expen
14、sive; highest level of accuracyThe orthogonal formulation Provides a balance between computational speed and accuracy i.e., less expensive than the average strain formulation and more accurate than the centroid formulationThe centroid formulation Least expensive; lowest level of accuracyC3D8RSimplif
15、ying the Model (9/14)Average strain kinematic formulation for hexahedral solid elementsThe default average strain kinematic formulation for solid elements is based on the uniform strain operator and orthogonal hourglass shape vectors. The resulting elements:Pass the constant strain patch test for a
16、general configuration.Have zero strain under large rigid body rotation.The default kinematic formulation can be very expensive, especially for three-dimensional problems. Simplifying the Model (10/14)Orthogonal kinematic formulation for hexahedral solid elementsThe orthogonal formulation is based on
17、 the centroidal strain operator and a slight modification to the hourglass shape vectors. The centroidal strain operator requires 3 fewer floating point operations than the uniform strain operator. These elements pass the patch test only for rectangular or parallelepiped element configurations.Howev
18、er, they converge well for general element configurations as the mesh is refined. This formulation is mended for all analyses except those involving:highly distorted elements, very coarse meshes, or high confinement. Simplifying the Model (11/14)Example: Copper rod impact*SOLID SECTION, ELSET=ROD, M
19、ATERIAL=COPPER, CONTROL=ORTHOGONAL*SECTION CONTROLS, KINEMATIC SPLIT=ORTHOGONAL, BINED, NAME=ORTHOGONALundeformed shapedeformed shapeC3D8R formulationShortening (mm)Widening (mm)Relative CPU timedefault (average strain)-13.085.4731orthogonal kinematics-13.095.4720.63Simplifying the Model (12/14)Cent
20、roid kinematic formulation for hexahedral solid elementsThe centroid formulation is the most economical. The centroid formulation is based on the centroidal strain operator and simple hourglass base vectors. Using the simple hourglass base vectors instead of the orthogonal hourglass shape vectors re
21、duces hourglass mode computations by a factor of three.The hourglass base vectors are not orthogonal to rigid body rotation for general element configurations. Therefore, hourglass strain may be generated with large rigid body rotations with this formulation. This formulation is mended only for prob
22、lems with little rigid body rotation and reasonable mesh refinement. Simplifying the Model (13/14)Example: Copper rod impact (contd)*SOLID SECTION, ELSET=ROD, MATERIAL=COPPER, CONTROL=ORTHOGONAL*SECTION CONTROLS, KINEMATIC SPLIT=CENTROID, BINED, NAME=ORTHOGONALdeformed shapeundeformed shapeC3D8R for
23、mulationShortening (mm)Widening (mm)Relative CPU timedefault (average strain)-13.085.4731orthogonal kinematics-13.095.4720.63centroidal kinematics-13.075.2310.49Simplifying the Model (14/14)Small-strain shell elementsSmall-strain shell elements were discussed in Lecture 2, Elements.Small-strain shel
24、l elements are more computationally efficient than their large-strain counterparts.They are appropriate for analyses involving small membrane strains and arbitrarily large rotationsundeformed shapedeformed shapePipe whip simulation analyzed with various shell element typesElement typeRelative CPU ti
25、meS4R1.0S4RSW0.78S4RS0.66Parallel Execution (1/13)Parallel execution: reduces run time for analyses that require a large number of increments and/or contain a large number of nodes and elementsproduces analysis results that are independent of the number of processors used provided the number of doma
26、ins is the same for each job 45,785 elements 370,086 DOF 3 ms simulation 87,369 incrementsFor the latest performance data visit Support Certified hardware SIMULIA System InformationParallel Execution (2/13)Activating parallel processing with the default options: Parallelization in Abaqus/Explicit is
27、 available for:shared memory architecture platforms using a thread-based loop level or thread-based domain position implementationboth shared and distributed memory architecture platforms using an MPI-based domain position parallel implementationabaqus job=phone cpus=4Parallel Execution (3/13)Shared
28、 memorySeparate processors share data on the same memory spacei.e., these are multiple processors of a single machine Data shared via pointers (thread-based) or passed via an interface (MPI)Distributed memory Processors have independent memory space i.e., these are multiple processors on separate ma
29、chines (clusters). Data passed via an interface (MPI)shared memory architectureCPU 1CPU 2Memorydistributed memory architecture (clusters)CPU 1CPU 2NetworkMemoryMemoryParallel Execution (4/13)shared memory architectureCPU 1CPU 2MemoryCommunication between processorsThe thread-based mode:data are shar
30、ed between processors via pointersavailable only on shared memory architecture platformsis available on all supported shared memory systemsThread - basedParallel Execution (5/13)shared memory architectureCPU 1CPU 2Memorydistributed memory architecture (clusters)NetworkCPU 1MemoryCPU 2MemoryCommunica
31、tion between processorsThe MPI (Message Passing Interface) mode: is available for both shared and distributed memory architecture platformsThe MPI facilitates messaging between processorsrequires that the MPI components be installedMPIMPIParallel Execution (6/13)Parallelization in Abaqus/Explicit is
32、 implemented in two ways: domain level loop levelDomain-level position methodWith this method Abaqus automatically breaks the model up into topological domains and assigns each domain to a processor. Domains are distributed evenly among the available processors. The analysis is then carried out inde
33、pendently in each domain.However, information must be passed between the domains in each increment because the domains share common boundaries. 8 domains1Parallel Execution (7/13)Domain-level position method (contd)The domain-level method is the default.It is mended for most large analyses.The domai
34、n-level speedup factor may be significantly more than what can be achieved with loop-level parallelization (discussed later). Both MPI- and thread-based communication schemes are supported with the domain-level method.The MPI mode is required for distributed memory systems (clusters).During the anal
35、ysis, separate selected results (.sel) and state (.abq) files are created for each processor.At the end of the analysis the individual files are merged automatically.There are some restrictions when using the domain-level method. For example, general contact is mended over contact pairs Refer to the
36、 user guides for more information.18 domainsParallel Execution (8/13)Domain-level position method (contd)Execution procedure (at the command line):abaqus job=phone cpus=8 parallel=domain domains=8 mp_mode=mpiDefaults can be modified using abaqus_v6.env.RequiredDefaults 18 domainsParallel Execution (
37、9/13)Domain-level position method (contd)Execution procedure (at the command line):abaqus job=phone cpus=4 parallel=domain domains= mp_mode=mpiRequired18 dynamic_load_balancing The number of domains must be evenly divisible by the number of CPUsDynamic load balancing is available when the number of
38、domains is higher than the number of CPUsThe efficiency of the dynamic load balancing scheme depends onthe load imbalance inherent to the problem, on the degree of position, mendation: number of domains 4 * number CPUsand on the efficiency of the hardware8 domainsParallel Execution (10/13)Domain-lev
39、el position method (contd)Applications most likely to benefit from dynamic load balancing are problems with a strongly time-dependent and/or spatially varying computational load. 1Airbag deploymentCoupled Eulerian-Lagrangian (CEL) bird strike analysis Eulerian domaintarget panels (Lagrangian)birds (
40、Eulerian)Parallel Execution (11/13)Domain-level position method (contd)Abaqus automatically creates element and node sets for each domain.8 domains1Parallel Execution (12/13)Loop-level methodThe loop-level method parallelizes low-level loops that are responsible for most of the computational cost. T
41、he element, node, and contact pair operations account for the majority of the low-level parallelized routines. Only the thread-based communication scheme is supported with the loop-level method.Distributed memory systems (clusters) are not supported.There are no additional restrictions when running
42、with this method;however, the speedup factor may be significantly less than what can be achieved with domain-level parallelization.The speedup factor will vary depending on the features included in the analysis.Execution procedure (at the command line):abaqus job =job-name cpus=n parallel=loop2Paral
43、lel Execution (13/13)Parallelization options summary for Abaqus/Explicit:MPI-based Loop-levelparallelizationDomain-levelparallelizationShared memory architectureDistributed memory architecture (clusters)Thread-basedThread-basedMPI-based Domain-levelparallelization= Default (when available) Technique
44、s for Reducing CPU Time (1/13)Features that can be used to reduce analysis run time include:Initial conditionsStatic initialization and importMonitoring for extreme values of output variablesMonitoring for steady stateMass scalingSelective subcyclingIncreasing load ratesTechniques for Reducing CPU T
45、ime (2/13)Initial conditions can be used to reduce the length of some analyses.You can define initial conditions (stress fields, hardening conditions, velocities,) rather than modeling the history that lead to the initial conditions.Example: Drop test of a cordless mouseApply initial velocity condit
46、ions consistent with a 1m free fall.V3Mouse initial velocity*INITIAL CONDITIONS, TYPE=VELOCITY ALL_NODES, 3, 4500Techniques for Reducing CPU Time (3/13)Static initialization and importAbaqus provides a capability to transfer the deformed mesh and associated material state between an Abaqus/Standard
47、analysis and an Abaqus/Explicit analysis (and vice-versa).Reduce the analysis run time by solving portions of an analysis with Abaqus/Standard.Examples:Simulate a transient event in Abaqus/Explicit after steady-state conditions have been simulated in Abaqus/StandardSimulate springback in Abaqus/Stan
48、dard after forming analysis in Abaqus/Explicit This functionality is discussed in Lecture 12, Importing and Transferring Results.steady-state spinning solution in Abaqus/Standarddynamic blade release analysis in Abaqus/ExplicitTechniques for Reducing CPU Time (4/13)Monitoring for extreme values of o
49、utput variablesMonitoring the values of critical variables and halting the analysis when those variables exceed a given criterion can increase modeling efficiency.For example, in a hydroforming simulation the time needed to model the completion of the physical process may be unknown. It may depend o
50、n the magnitude of the displacement of a node or a group of nodes in the model.Terminating the analysis when that displacement is reached may save many CPU hours.fluid pressurepunch loadTechniques for Reducing CPU Time (5/13)Use output filters to record bounding output values over time and halt an a
51、nalysis when a critical value has been reachedThe bounding value operators areMaximum valueMinimum valueAbsolute maximum valueThese can be applied to field output and history outputThey can be applied to digitally filtered or unfiltered data For more information on digital filters see Lecture 11 Out
52、put FilteringFor tensor and vector field output requests, Abaqus applies the bounding value operations to each component independently. Techniques for Reducing CPU Time (6/13)Example: Hydroforming of a boxStop the analysis before the MONITOR node pulls out from between the holder and the draw cap. D
53、efine the filter*FILTER, NAME=MIN-53, OPERATOR=MIN, LIMIT=-53, HALTExploded view of initial configuration draw capblankBlank holderpunchhydroforming pressure loadpunch load13253 mm in the negative 2-directionMONITOR nodeUse the Operator type to find the minimum without digitally filtering the result
54、1Techniques for Reducing CPU Time (7/13)Example: Hydroforming of a boxStop the analysis before the MONITOR node pulls out from between the holder and the draw cap. Define the output request*Output, history, filter=Min-53*Node Output, nset=MONITOR U2,2Techniques for Reducing CPU Time (8/13)Example (c
55、ontd): Hydroforming of a boxThe end of the status (.sta) file:Final deformed shape of the blankMONITOR (node 2)Techniques for Reducing CPU Time (9/13)Monitoring for steady stateSteady-state detection can be used to terminate an Abaqus/Explicit analysis when specified steady-state criteria are met.Th
56、is feature is available for quasi-static unidirectional processes.For example rolling, wire drawing, and extrusion processes The following steady-state detection norms are available for the specification of steady-state detection criteria:Normalized cross-section plastic strain at a specified cuttin
57、g planeLargest of the area moments of inertia of the plane cross sectionAverage force magnitude at a rigid body reference nodeAverage torque magnitude at a rigid body reference nodeTerminating the analysis when steady state is reached may save many CPU hours.Techniques for Reducing CPU Time (10/13)E
58、xample: Steady-state flat rolling simulationSteady-state detection is used to terminate a flat rolling simulation.A complete discussion of this example is included in Appendix 3 Advanced Adaptive Meshing.Request sampling at uniform intervals for an Eulerian adaptive meshing analysis. Cutting plane A
59、nalysis terminates when the steady state is detected at the cutting plane.outflowinflowsteady-state criteria definitionOnly when all of the criteria specified have been satisfied will the analysis be considered to have reached steady state. Techniques for Reducing CPU Time (11/13)Mass scaling speeds
60、 up an analysis by increasing the stable time increment size.Artificially increasing the material density by a factor of f 2 increases the stable time increment by a factor of f.Mass scaling for quasi-static analyses was discussed in Lecture 4.Mass scaling for high-speed dynamic events was discussed
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026年云南省大理白族自治州单招职业倾向性测试题库含答案详解
- 2026年上海立信会计金融学院单招职业适应性考试题库含答案详解(a卷)
- 2026年上海大学单招职业倾向性考试题库及完整答案详解一套
- 2026年云南省临沧地区单招职业倾向性测试题库附参考答案详解(典型题)
- 2026年亳州职业技术学院单招综合素质考试题库附参考答案详解(预热题)
- 2026年云南省大理白族自治州单招职业倾向性考试题库含答案详解(考试直接用)
- 2026年中山火炬职业技术学院单招职业倾向性考试题库附答案详解(综合卷)
- 2026年云南省楚雄彝族自治州单招职业适应性测试题库及参考答案详解1套
- 2026年上海戏剧学院单招职业倾向性考试题库附答案详解(研优卷)
- 2026年上海立达学院单招职业技能测试题库附参考答案详解(典型题)
- 生产工单管理办法
- 豪迈创新管理办法
- 脑出血恢复期的护理课件
- 数字营销课件模块三数字化消费者行为
- 景观照明设施养护服务方案投标文件(技术方案)
- 儿童中医养生保健方案
- 苏教牛津译林版小学英语六年级上册单词背诵默写本
- 高中英语词汇3500词(必背)-excel版
- 四川省高职院校单独招生文化考试(中职类)数学模拟试题
- 老年康复患者住院病案书写要点专家共识
- GB/T 21499-2024粮油检验稻谷和糙米潜在出米率测定方法
评论
0/150
提交评论