已阅读5页,还剩91页未读, 继续免费阅读
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
npacirockstutorial,npaciallhandsmeetingmarch18,2003,copyright2003ucregents,schedule,rocks101(45mins)introductiontorocksclusterarchitecture(hw&sw)rockssoftwareoverviewadvancedrocks(45mins)modifyingthedefaultconfigurationclustermonitoring(ucbsganglia)handsonlabs(90mins)breakoutintogroupsandbuildanx86clusterrunanmpijobcustomization(packages,monitoring)comparetoia64cluster,rocks101,copyright2003ucregents,makeclusterseasy,enableapplicationscientiststobuildandmanagetheirownresourceshardwarecostisnottheproblemsystemadministratorscostmoney,anddonotscalesoftwarecanreplacemuchoftheday-to-daygrindofsystemadministrationtrainthenextgenerationofusersonlooselycoupledparallelmachinescurrentprice-performanceleaderforhpcuserswillbereadyto“stepup”tonpaci(orother)resourceswhenneededrocksscalestotop500sizedresourcesexperimentonsmallclustersbuildyourownsupercomputerwiththesamesoftware!,copyright2003ucregents,past,rocks1.0releasedatsc2000goodstartonautomatingclusterbuildingincludedearlyprototype(writteninperl)ofgangliaresultofcollaborationbetweensdscanducbsmillenniumgroup.rocks2.xfullyautomatedinstallation(frontendandcomputenodes)programmablesoftwareconfigurationscs(singapore)firstexternalgrouptocontributesoftwarepatches,copyright2003ucregents,present,rocks2.3.2(todayslabispartofourbetatesting)firstsimultaneousx86andia64releaseprovenscalability#233oncurrenttop500list287-nodeproductionclusteratstanfordyoucanalsobuildsmallclustersimpactrocksclusterson6continentsnoantarcticayet.4largensfitrsusingrocksascoresoftwareinfrastructure,rocksregistrationpage(5daysold),/rocks-register,copyright2003ucregents,rocksinthecommercialworld,rocksclustervendorscraydellpromicrosystemsseepage79ofaprilslinuxjournalscs(insingapore)contributedpvfs,sgetorocksactiveontherocksmailinglisttrainingandsupportintelistrainingcustomersonrockscallidentisofferingsupportservices,copyright2003ucregents,promicrosystems,copyright2003ucregents,alternativeclustertoolkits,cplantdoeclusteringsoftwareopenmosixsinglesystemimageopenscekasetsartuniversity,thailandcompatiblewithrocksoscarakacluster-in-a-box,msc.linux“best-practicesofclustering”packagingeffort,notaclusterdistributionscyldbprocbasedsystem(ssiprocessspace)commercialscoreoldestclusteringefforttouchesonallaspectsofclusteringsoftware,copyright2003ucregents,whatsetsusapart,fullyautomatedclusterdeploymentgetandburnisocd(dvdforia64)imagefrombootfrontendwithcd/dvdfillout7configurationscreens(mostlyredhat)rebootfrontendmachineintegratecomputenodeswithinsert-ethersreadytogo!completeoutoftheboxsolutionwithrationaldefaultsettingsidenticalenvironmentforx86oria64,testimonialsandcomplaints,fromtherocks-discussmaillistandothersources,copyright2003ucregents,newuseronrocksmaillist,“imanagedtoinstallrockswithfivenodes.thenodeshaveasmallhd2.5gbeach,theclusterisinmyhomeonaprivatenetworkbehindalinuxboxfirewall.anditlookslikeeverythingisworkingfine.icanseeallthenodesandthefront-endinthegangliawebinterface.ibuiltitsoicanlearnmoreaboutclusters.andtotellthetruthihavenoideaonwhattodowithit,imeanwheretostart,howtouseit,whattouseitfor.”,copyright2003ucregents,powerusers,responsetopreviousposter“itssometimesscaryhoweasyitistoinstallrocks.”thiscoinedthephrase“rocks,scarytechnology”commentfromaclusterarchitect“youguysarethemastersofthebleedinobvious.”thisoneflatteredus,copyright2003ucregents,anothernewuser,“ivesetuparocksclusterthankstothebonesimpleinstallation.thanksformakingitsoeasy.thedrawback,becauseitwassoeasy,ididntlearnmuchaboutclustering.”,copyright2003ucregents,independentsurveyofclusteringsoftware,http:/heppc11.ft.uam.es/clusters/doccomparesrocks,oscarandothers“npacirocksistheeasiestsolutionfortheinstallationandmanagementofaclusterunderlinux.”“toinstallandmanagealinuxclusterunderoscarismoredifficultthanwithrocks.”“withoscaritisnecessarytohavesomeexperienceinlinuxsystemadministration,andsomeknowledgeofclusterarchitecture”ourgoalisto“makeclusterseasy”automatethesystemadministrationwhereverpossibleenablenon-clusterexpertstobuildclusters,copyright2003ucregents,andfinallyaweeklymessage,“yourdocumentationsucks.”guilty,butimprovingrocksnowinstallsusersguideoneverynewclustermailinglisthasseveralextremelyhelpfulusers,hardware,copyright2003ucregents,basicsystemarchitecture,copyright2003ucregents,whyx86clusters?,source:davepierce,sio,copyright2003ucregents,minimumcomponents,x86/ia64server,localharddrive,power,ethernet,copyright2003ucregents,optionalcomponents,myrinethigh-performancenetworknetwork-addressablepowerdistributionunitkeyboard/video/mousenetworknotrequirednon-commodityhowdoyoumanageyourmanagementnetwork?crashcartshavealowertco,software,copyright2003ucregents,clustersoftwarestack,copyright2003ucregents,commontoanycluster,copyright2003ucregents,redhat,stockredhat7.3w/updates(aw2.1foria64)linux2.4kernelnosupportforotherdistributionsredhatisthemarketleaderforlinuxintheusandbecomingsoineuropeexcellentsupportforautomatedinstallationscriptableinstallation(kickstart)verygoodhardwaredetection,copyright2003ucregents,batchsystems,portablebatchsystemmomdaemononeverynodeusedforjoblaunchingandhealthreportingserveronthefrontendonlyqueuedefinition,andaggregationofnodeinformationschedulerpoliciesforwhatjobtorunoutofwhichqueueatwhattimemauiisthecommonhpcschedulersge-sungridenginealternativetopbsintegratedintorocksbyscs(singapore)schedulermanagescarceresourcesclustersarecheapyoumightnotwantascheduler,copyright2003ucregents,communicationlayer,none“embarrassinglyparallel”socketsclient-servermodelpoint-to-pointcommunicationmpi-messagepassinginterfacemessagepassingstaticmodelofparticipantspvm-parallelvirtualmachinesmessagepassingforheterogeneousarchitecturesresourcecontrolandfaulttolerance,copyright2003ucregents,socketsarelowlevel,socketspoint-to-pointnmachines=(n2-n)/2connections1,3,6,10,15,mpi/pvmsharedvirtualchannelimplementationcouldbesocketseasiertoprogram,copyright2003ucregents,rocksclustersoftware,copyright2003ucregents,clusterstatemanagement,staticinformationnodeaddressesnodetypessite-specificconfigurationdynamicinformationcpuutilizationdiskutilizationwhichnodesareonline,copyright2003ucregents,clusterdatabase,copyright2003ucregents,ganglia,scalableclustermonitoringsystembasedonipmulti-castmattmassie,etalfromucbgmondaemononeverynodemulticastssystemstatelistenstootherdaemonsalldataisrepresentedinxmlgangliacommandlinepythoncodetoparsexmltoenglishgmetricextendsgangliacommandlinetomulticastsinglemetrics,copyright2003ucregents,gangliascreenshot,copyright2003ucregents,clustersoftwaremanagement,softwarepackagesrpmsstandardredhat(desktop)packagedsoftwareoryourownaddonsrocks-distmanagestherpmrepositorythisisthedistribution,softwareconfigurationtuningrpmsforclustersforyoursiteothercustomizationxmlkickstartprogrammaticsystembuildingscalable,copyright2003ucregents,buildingarocksdistribution,startwithredhataddupdates,rocks(andoptionalother)softwareaddkickstartprofilesmodifyredhatinstallationbootimageresultinginaredhatcompatiblerocksdistribution,copyright2003ucregents,kickstart,redhatskickstartmonolithicflatasciifilenomacrolanguagerequiresforkingbasedonsiteinformationandnodetype.rocksxmlkickstartdecomposeakickstartfileintonodesandagraphgraphspecifiesooframeworkeachnodespecifiesaserviceanditsconfigurationmacrosandsqlforsiteconfigurationdrivenfromwebcgiscript,copyright2003ucregents,samplenodefile,enablessheofhost*forwardx11yesforwardagentyeseofchmodo+rx/rootmkdir/root/.sshchmodo+rx/root/.ssh,copyright2003ucregents,samplegraphfile,defaultgraphfornpacirocks.,copyright2003ucregents,kickstartframework,copyright2003ucregents,appliances,laptop/desktopappliancesfinalclassesnodetypesdesktopisastandalonelaptopisastandalonepcmciacodere-useisgood,copyright2003ucregents,optionaldrivers,pvfsparallelvirtualfilesystemkernelmodulebuiltforallnodesinitialsupport(fullsupportinfutureversionofrocks)usermustdecidetoenablemyrinethighspeedandlowlatencyinterconnectgm/mpiforuserapplicationskernelmodulebuiltforallnodeswithmyrinetcards,copyright2003ucregents,yourclustersoftware,letsbuildacluster,copyright2003ucregents,hardwaresetup,grabtwonodesusethecross-overcabletoconnecteth0onthefrontendtoeth0onthecomputenodehookupamonitorandkeyboard,copyright2003ucregents,softwareinstallation,putthecdinthefrontendattheboot:prompt,type:frontendstartconfiguration,copyright2003ucregents,clusterinformation,copyright2003ucregents,partitionyourdisks,copyright2003ucregents,configureprivatenetwork,copyright2003ucregents,configurepublicnetwork,copyright2003ucregents,setyourhostname,copyright2003ucregents,settherootpassword,copyright2003ucregents,configurenis,copyright2003ucregents,andawaywego,advancedrocks,copyright2003ucregents,addcomputenodewithinsert-ethers,collecttheethernetmacaddressofclusternodesonlydoneonce,duringintegrationpopulatesclusterdatabase,copyright2003ucregents,discoveredappliance,copyright2003ucregents,54,remotelystartingreinstallationontwonodes,53,monitortheinstallwithekv,copyright2003ucregents,gangliagmetrics,howtodesignnewmetricsforgangliatotrackandgraph.calleduser-definedmetrics,orgmetrics.1.writegatheringfunctioninpython.2.pushtoallcomputenodes.3.allowgscheduletocallgathererrepeatedly.4.watchthegraphsgrow.,copyright2003ucregents,writemetricgathererinpython,writingagmetricgathereriseasyinpython:rockshasnativesupport.gscheduleisadaemonrunningbydefaultonallrocksnodes.willcallourgathererfunctionatregularintervals,withsomerandomization.wejustneedtoprovideanameandavalueforthemetric.,#sdscallhandsmeeting2003-advancedrockstutorial.#agmetric.#fromgmon.gmetricimportmetricfromstringimportcount,splitclasspackets(metric):publishesnumberofoutmulticastpacketsoneth0.dev=eth0def_init_(self):#publishvalueevery2secondsonaverage.metric._init_(self,2)defname(self):returnmcast-packets-in-%s%(self.dev)defvalue(self):dev=open(/proc/net/dev)forlineindev.readlines():ifnotcount(line,self.dev):continue#discarddev:fields=split(line,:)1values=split(fields)dev.close()returnint(values7),packets.py,copyright2003ucregents,pushnewgmetrictoallnodes,testonindividualnodeslogin(ssh)copypythonsourceto/opt/ganglia/lib/python/gmon/metrics/packaginguserpmforpermanentinstallation,computenodes,copyright2003ucregents,gscheduledaemon,therocksgscheduledaemonwillcallourpacketsmetricrepeatedly.gschedule-debugor#servicegschedulerestartwillpublishourmetricevery2seconds.gangliasgmetadwilldetectnewmetric,andgraphit.,gschedule,time,copyright2003ucregents,visualization,ournewgmetricwillbegraphedbygmetad.visibleinwebfrontendshostview.onlymetricswhicharenumericandvolatile(slopezero)aregraphed.gmetricrestrictions:totalmetricsize/tmp/ks.cfglookforcontrib-newin%packagessectionof/tmp/ks.cfg,copyright2003ucregents,reinstallcomputenodes,useshoot-nodetoreinstallcomputenodesinordertoapplythecontrib-newpackage#shoot-nodecompute-0-0,copyright2003ucregents,buildingacustomkernelrpm,pickacomputenode#sshcompute-0-0developonthisnode,doesnt“trash”theenvironmentonthefrontendcreateacustom.configfile.thiscanbedonefromscratchorbasedonacannedredhatconfigurationfile#cd/usr/src/linux-2.4#cpconfigs/kernel-2.4.18-i686-smp.config.config#vi.config,copyright2003ucregents,buildingacustomkernelrpm,buildakernelrpm#makerpmcopytheresultingkernelrpmtothefrontend:#scp/usr/src/redhat/rpms/i686/kernel.rpmfrontend-0:/home/install/contrib/rpms/public/i386rebuildthedistroonthefrontend#cd/home/install#rocks-distdist,copyright2003ucregents,computenodepartitioning,creates4gbrootpartitiononfirstdrivethispartitionisvolatile,thatis,whenthenodeisreinstalled,thispartitionisreformattedremainderoffirstdriveisputintoapartitioncalled“/state/partition1”foreachremainingdrives,onepartitioniscreatedperdriveandnamed“/state/partition2”,“/state/partition3”,etc.allpartitionslabeled“/state/partitionn”arenotreformattedonreboots.,copyright2003ucregents,example,rootdrive18gb/dev/sda1/4gb/dev/sda2/state/partition114gb,copyright2003ucregents,example,seconddrive36gb/dev/sdb1/state/partition236gb,copyright2003ucregents,example,thirddrive18gb/dev/sdc1/state/partition318gb,copyright2003ucregents,example,fourthdrive72gb/dev/sdd1/state/partition472gb,copyright2003ucregents,example,fifthdrive36gb/dev/sde1/state/partition536gb,copyright2003ucregents,example,sixthdrive181gb/dev/sdf1/state/partition6181gb,copyright2003ucregents,example,copyright2003ucregents,user-specifiedpartitioning,overridethedefaultpartitioningconfigurationfile:#cd/home/install/profiles/2.3.2/site-nodes#cpskeleton.xmlreplace-auto-partition.xmladdyourpartitionstoreplace-auto-partition.xml/-size5000-ondisksdaswap-size512-ondisksda/mydata-size1-grow-ondisksda,copyright2003ucregents,buildingyourowncdset,aftercustomizingyourclusterdistribution,makeisoimagestoexportitfirst,youmustmirrorthefullrocksrelease#cd/home/install#rocks-distmirroror,putinthesecond(thenthird)rockscdandexecute:#rocks-distcopycd,copyright2003ucregents,buildingyourowncdset,buildacdset#cd/home/install#rm-rfcdrom#rocks-dist-dist=cdromcdromthiscreatesacdsetunderthedirectory/home/install/cdrom#cd/home/install/cdrom/7.3/en/os,lab,copyright2003ucregents,thanks,inspiredbyalabconductedattacc,copyright2003ucregents,lab,building2-nodex86clustersgangliagridaddingusersrunningli
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- (完整版)苏教六年级下册期末数学模拟真题真题经典答案
- 客服助理客户服务支持计划及投诉处理方案
- 放空操作工年度工作目标设定
- 幼儿家庭教育的重要性
- 太阳能电站运营与管理面试经验分享
- 2025年教师随机面试题及答案
- 密码访问控制员密码访问控制策略评审
- 2025福建龙岩市上杭县交通建设有限公司招聘工程专业技术人员拟聘笔试历年参考题库附带答案详解
- 2025燕京啤酒(呼和浩特)有限公司招聘3人笔试历年参考题库附带答案详解
- 国企业务知识深度解读与面试技巧分享
- 2026年春湘教版地理八年级下册第五章 第三节第2课时 青藏地区课件
- 牌坊施工安全协议书
- 高血压脑出血软通道治疗
- 青少年宪法教育
- GB/T 31897.1-2025灯具性能第1部分:一般要求
- 2025广东惠州大亚湾开发区招聘西区街道社区工作者30人历年真题汇编带答案解析
- 南京工业大学研究生学位论文格式的规定
- 中国精锐人群生活方式洞察报告(2025年版)
- 2025年交通安全知识竞赛试题(附答案)
- 行政管理专科毕业论文(国家开放大学)
- JJF1033-2023计量标准考核规范
评论
0/150
提交评论