The Peregrine high-performance RPC system

ThePeregrineHigh-PerformanceRPCSystemDavidB.Johnson1WillyZwaenepoelDepartmentofComputerScienceRiceUniversityP.O.Box1892Houston,Texas77251-1892dbj@cs.cmu.edu,willy@cs.rice.eduAversionofthispaperappearedinSoftware—Practice&Experience,23(2):201–221,February1993.ThisworkwassupportedinpartbytheNationalScienceFoundationunderGrantsCDA-8619893andCCR-9116343,andbytheTexasAdvancedTechnologyProgramunderGrantNo.003604014.1Author’scurrentaddress:SchoolofComputerScience,CarnegieMellonUniversity,Pittsburgh,PA15213-3891.SummaryThePeregrineRPCsystemprovidesperformanceveryclosetotheoptimumallowedbythehardwarelimits,whilestillsupportingthecompleteRPCmodel.ImplementedonanEthernetnetworkofSun-3/60workstations,anullRPCbetweentwouser-levelthreadsexecutingonseparatemachinesrequires573microseconds.ThistimecompareswellwiththefastestnetworkRPCtimesreportedintheliterature,rangingfromabout1100to2600microseconds,andisonly309microsecondsabovethemeasuredhardwarelatencyfortransmittingthecallandresultpacketsinourenvironment.Forlargemulti-packetRPCcalls,thePeregrineuser-leveldatatransferratereaches8.9megabitspersecond,approachingtheEthernet’s10megabitpersecondnetworktransmissionrate.Betweentwouser-levelthreadsonthesamemachine,anullRPCrequires149microseconds.ThispaperidentiﬁessomeofthekeyperformanceoptimizationsusedinPeregrine,andquantitativelyassessestheirbeneﬁts.Keywords:Peregrine,remoteprocedurecall,interprocesscommunication,performance,distributedsystems,operatingsystems1.IntroductionThePeregrineremoteprocedurecall(RPC)systemisheavilyoptimizedforprovidinghigh-performanceinterprocesscommunication,whilestillsupportingthefullgeneralityandfunctionalityoftheRPCmodel[3,10],includingargumentsandresultvaluesofarbitrarydatatypes.ThesemanticsoftheRPCmodelprovidesampleopportunitiesforoptimizingtheperformanceofinterprocesscommunication,someofwhicharenotavailableinmessage-passingsystemsthatdonotuseRPC.ThispaperdescribeshowPeregrineexploitstheseandotheropportunitiesforperformanceimprovement,andpresentsPeregrine’simplementationandmeasuredperformance.WeconcentrateprimarilyonoptimizingtheperformanceofnetworkRPC,betweentwouser-levelthreadsexecutingonseparatemachines,butwealsosupportefﬁcientlocalRPC,betweentwouser-levelthreadsexecutingonthesamemachine.High-performancenetworkRPCisimportantforsharedserversandforparallelcomputationsexecutingonnetworksofworkstations.PeregrineprovidesRPCperformancethatisveryclosetothehardwarelatency.FornetworkRPCs,thehardwarelatencyisthesumofthenetworkpenalty[6]forsendingthecallandtheresultmessageoverthenetwork.Thenetworkpenaltyisthetimerequiredfortransmittingamessageofagivensizeoverthenetworkfromonemachinetoanother,andismeasuredwithoutoperatingsystemoverheadorinterruptlatency.Thenetworkpenaltyisgreaterthanthenetworktransmissiontimeforpacketsofthesamesizebecausethenetworkpenaltyincludesadditionalnetwork,device,andprocessorlatenciesinvolvedinsendingandreceivingpackets.LatencyforlocalRPCsisdeterminedbytheprocessorandmemoryarchitecture,andincludestheexpenseoftherequiredlocalprocedurecall,kerneltraphandling,andcontextswitchingoverhead[2].WehaveimplementedPeregrineonanetworkofSun-3/60workstations,connectedbya10megabitpersecondEthernet.Theseworkstationseachusea20-megahertzMotorolaMC68020processorandanAMDAm7990LANCEEthernetnetworkcontroller.TheimplementationusesanRPCpacketprotocolsimilartoCedarRPC[3],exceptthatablastprotocol[20]isusedformulti-packetmessages.TheRPCprotocolislayereddirectlyontopoftheIPInternetdatagramprotocol[13].Inthisimplementation,themeasuredlatencyforanullRPCwithnoargumentsorreturnvaluesbetweentwouser-levelthreadsexecutingonseparateSun-3/60workstationsontheEthernetis573microseconds.ThistimecompareswellwiththefastestnullnetworkRPCtimesreportedintheliterature,rangingfromabout1100to2600microseconds[3,12,8,15,17,19],andisonly309microsecondsabovethemeasuredhardwarelatencydeﬁnedbythenetworkpenaltyforthecallandresultpacketsinourenvironment.AnullRPCwithasingle1-kilobyteargumentrequires1397microseconds,showinganincreaseoverthetimefornullRPCwithnoargumentsofjustthenetworktransmissiontimefortheadditionalbytesofthecallpacket.Thistimeis338microsecondsabovethenetworkpenalty,andisequivalenttoauser-leveldatatransferrateof5.9megabitspersecond.Forlargemulti-packetRPCcalls,thenetworkuser-leveldatatransferratereaches8.9megabitspersecond,achieving89percentofthehardwarenetworkbandwidthand95percentofthemaximumachievabletransmissionbandwidthbasedonthenetworkpenalty.Betweentwouser-levelthreadsexecutingonthesamemachine,anullRPCwithnoargumentsorreturnvaluesrequires149microseconds.InSection2ofthispaper,wepresentanoverviewofthePeregrineRPCsystem.Section3discussessomeofthekeyperformanceoptimizationsusedinPeregrine.InSection4,wedescribethePeregrineimplementation,includingsingle-packetnetworkRPCs,multi-packetnetworkRPCs,andlocalRPCs.ThemeasuredperformanceofPeregrineRPCispresentedinSection5.InSection6,wequantifytheeffectivenessoftheoptimizationsmentionedinSection3.Section7comparesourworktootherRPCsystems,andSection8presentsourconclusions.12.Overview

The Peregrine high-performance RPC system

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

CRM实验室课程导入

临时围板、围墙及安全护棚施工方案(新)

第一章财产保险基础

第八章金融犯罪

欧盟水产品卫生法规及要求

haccp培训教程2

商务策划立场与原则

农副产品定购合同(1)

污水处理厂运行报表管理制度

品牌升级战略管理杉杉集团公司

相关文档

相关搜索