1 An Efficient Procedure for Building the Function

1AnEfficientProcedureforBuildingtheFunctionalPerformanceModelofaProcessorAlexeyLastovetsky,RaviReddy,RobertHigginsDepartmentofComputerScience,UniversityCollegeDublin,BelfieldDublin4,IrelandE-mail:Alexey.Lastovetsky@ucd.ie,Manumachu.Reddy@ucd.ie,Robert.Higgins@ucd.ieAbstract---Inthispaper,wepresentanefficientprocedureforbuildingapiecewiselinearfunctionapproximationofthespeedfunctionofaprocessorwithhierarchicalmemorystructure.Theproceduretriestominimizetheexperimentaltimeusedforbuildingthespeedfunctionapproximation.WedemonstratetheefficiencyofourprocedurebyperformingexperimentswithamatrixmultiplicationapplicationandaCholeskyFactorizationapplicationthatusememoryhierarchyefficientlyandamatrixmultiplicationapplicationthatusesmemoryhierarchyinefficientlyonalocalnetworkofheterogeneouscomputers.1.IntroductionInourpreviousresearch[1],weaddressedtheproblemofoptimaldistributionorschedulingofcomputationaltasksonnetworksofheterogeneouscomputerswhenoneormoretasksdonotfitintothemainmemoryoftheprocessorsandwhenrelativespeedsofprocessorscannotbeaccuratelyapproximatedbyconstantfunctionsoftheproblemsize.Wedesignedefficientalgorithmstosolvethisschedulingproblemusingaperformancemodelthatintegratessomeoftheessentialfeaturesofaheterogeneousnetworkofcomputers(HNOC)havingamajorimpactontheperformance,suchastheprocessorheterogeneity,theheterogeneityofmemorystructure,andtheeffectsofpaging.Underthismodel,thespeedofeachprocessorisrepresentedbyacontinuousandrelativelysmoothfunctionoftheproblemsize.Thismodelisapplication-centricinthesensethatgenerallyspeakingdifferentapplicationswillcharacterizethespeedoftheprocessorbydifferentfunctions.Actuallyongeneral-purposecommonheterogeneousnetworks,2sizeoftheproblemAbsolutespeed)(1xs)(2xsFigure1.Usingpiecewiselinearapproximationtobuildspeedbandsfor2processors.Thecircularpointsareexperimentallyobtainedwhereasthesquarepointsarecalculatedusingheuristics.Thespeedbandforprocessors1(x)isbuiltfrom3experimentallyobtainedpoints(applicationrunonthisprocessorusesmemoryhierarchyinefficiently)whereasthespeedbands2(x)(applicationrunonthisprocessorusesmemoryhierarchyefficiently)isbuiltfrom4experimentallyobtainedpoints.anintegratedcomputerwillexperienceconstantandstochasticfluctuationsintheworkload.Thischangingtransientloadwillcauseafluctuationinthespeedofthecomputerinthesensethattheexecutiontimeofthesametaskofthesamesizewillvaryfordifferentrunsatdifferenttimes.Thenaturalwaytorepresenttheinherentfluctuationsinthespeedistouseaspeedbandratherthanaspeedfunction.Thewidthofthebandcharacterizestheleveloffluctuationintheperformanceduetochangesinloadovertime.Inourpreviousresearch,wedidnotproposeanymethodstobuildandmaintainthespeedbandofaprocessor.Inthispaper,wepresentanefficientandapracticalprocedureforbuildingapiecewiselinearfunctionapproximationofthespeedbandofaprocessorwithhierarchicalmemorystructure.Thisbandshouldbeabletorepresentanyspeedfunctionoftheprocessor,thatis,anyspeedfunctionrepresentingtheperformanceoftheprocessorshouldfitintothespeedband.Theproceduretriestominimizetheexperimentaltimeusedforbuildingthepiecewiselinearfunction3approximationofthespeedband.Wedonotproposemethodstomaintainthespeedfunctionapproximation.Thisisasubjectofourfutureresearch.Samplepiece-wiselinearfunctionapproximationsofthespeedbandofaprocessorareshowninFigure1.Eachoftheapproximationsisbuiltusingasetoffewexperimentallyobtainedpoints.Themorepointsusedtobuildtheapproximation,themoreaccuratetheapproximationis.Howeveritisprohibitivelyexpensivetouselargenumberofpoints.Henceanoptimalsetoffewpointsneedstobechosentobuildanefficientpiecewiselinearfunctionapproximationofthespeedband.Suchanapproximationgivesthespeedoftheprocessorforanyproblemsizewithcertainaccuracywithintheinherentdeviationoftheperformanceofcomputerstypicallyobservedinthenetwork.Therestofthepaperisorganizedasfollows.Insection2,weformulatetheproblemofbuildingapiecewiselinearfunctionapproximationofaprocessorandpresentanefficientandapracticalproceduretosolvetheproblem.Todemonstratetheefficiencyofourprocedure,weperformexperimentsusingamatrixmultiplicationapplicationandaCholeskyFactorizationapplicationthatusememoryhierarchyefficientlyandamatrixmultiplicationapplicationthatusesmemoryhierarchyinefficientlyonalocalnetworkofheterogeneouscomputers.2.ProcedureforBuildingSpeedFunctionApproximationThissectionisorganizedasfollows.Westartwiththeformulationofthespeedbandapproximationbuildingproblem.Thisisfollowedbyasectiononobtainingtheloadfunctionscharacterizingtheleveloffluctuationinloadovertime.Thethirdsectionpresentstheassumptionsadoptedbyourprocedure.Wethenpresentsomeoperationsandrelationsrelatedtothepiecewiselinearfunctionapproximationofthespeedband.Andfinallyweexplainourproceduretobuildthepiecewiselinearfunctionapproximation.4SizeoftheproblemAbsolutespeedSizeoftheproblemAbsolutespeedreal-lifepiecewise(a)(b)SizeoftheproblemAbsolutespeedx))(,(maxxsx))(,(minxsxSizeoftheproblemAbsolutespeed1x2x3x))(,(1min1xsx))(,(2min2xsx))(,(3min3xsx))(,(1max1xsx))(,(2max2xsx))(,(3max3xsx(c)(d)Figure2.(a)Real-l

1 An Efficient Procedure for Building the Function

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

供应链管理模式的CEO(doc7)

微电子器件与IC设计 (1)

动力学研究物体的机械运动与作用力之间的关系

房建工程报验表格新

建筑工程质量通病89626793

酒店管理系统设计与实现

云复制平台产品使用说明书V13

电视节目策划

世联_河北唐山新天地项目开发公司诊断沟通小结_80页_XXXX年

新元制度之《企业文化大纲》

相关文档

相关搜索