Evaluating Capture-Recapture Models with Two Inspe

V13–17/05/991EvaluatingCapture-RecaptureModelswithTwoInspectorsKhaledElEmamOliverLaitenbergerNationalResearchCouncil,CanadaInstituteforInformationTechnologyBuildingM-50,MontrealRoadOttawa,OntarioCanadaK1AOR6Khaled.El-Emam@iit.nrc.caFraunhoferInstituteforExperimentalSoftwareEngineeringSauerwiesen6D-67661KaiserslauternGermany+49(0)6301707251laiten@iese.fhg.deAbstractCapture-recapture(CR)modelshavebeenproposedasanobjectivemethodforcontrollingsoftwareinspections.CRmodelswereoriginallydevelopedtoestimatethesizeofanimalpopulations.Theyhavealsobeenusedtoestimatethenumberofdefectsinaninspectedartifact.Armedwiththisestimate,onecandecidewhethertheartifactrequiresareinspectiontoensurethataminimalinspectioneffectivenesslevelhasbeenattained.LittleevaluativeresearchhasbeenperformedthusfarontheutilityofCRmodelsforinspectionswithtwoinspectors.Furthermore,thesestudieshavefocusedontherelativeerrorofthedefectcontentestimatesexclusively.InthispaperwereportonanextensiveMonteCarlosimulationthatevaluatedsixcapture-recapturemodelsfortwoinspectorsassumingacodeinspectionscontext.Inadditiontorelativeerror,weevaluatetheaccuracyofthereinspectiondecision.Thelatterismorecongruentwiththemannerinwhichthesemodelswouldbeusedinpractice.Ourresultsindicatethatthemostappropriatecapture-recapturemodelfortwoinspectorsisanestimatororiginallydevelopedbyChapmanthatallowsforinspectorswithdifferentcapabilities.Thiswillhavearelativelyhighdecisionaccuracyandwillperformbetterthanthedefaultdecisionofnoreinspections.Furthermore,weidentifytheconditionsunderwhichthisestimatorwillperformbest.1IntroductionArecentliteraturereviewfoundthat,onaverage,softwareinspectionsfindonly57%ofdefectsincodeanddesigndocuments[8].Giventhesubstantialdefectdetectioncostsavingsthatcanbeaccruedbyincreasingtheeffectiveness1ofinspections[8],contemporaryresearchhasfocusedonimprovedreadingtechniques(e.g.,see[33][3][19][41])andonreinspections(e.g.,see[24])formaximizingeffectiveness.Thefocusofthispaperisonmaximizinginspectioneffectivenessthroughreinspections.Reinspectionscanbeconsideredpartofthegeneralproblemofwhentostopinspections.Asisthecasewithtesting,oneneedsacriterionbywhichtodecidewhetheradocumentshouldbeinspectedanew,orwhetheritcanpasstothesubsequentphase.Mostorganizationshavenotinstitutionalizedproceduresfordecidingwhentostopsoftwareinspections.Thosethatdohaveutilized,forexample,historicalnormssothatiftoomanydefectsarefoundcomparedtothenormthenthisistakenasevidenceofapoordocument,whiletoofewaretakenasevidenceofapoorinspection[24].However,thisapproachassumesthatvariationsamongreviewsarelargerthanvariationsamongdocuments.Ifthisisnotthecasethenthiscanleadtoreinspectionsofhighqualitydocuments,andlowqualitydocumentsmayeasilypass.Toaddressthesepotentialproblems,onecanuseCapture-Recapture(CR)models.CRmodelswereinitiallydevelopedtoestimatethesizeofanimalpopulations(e.g.,see[38][51]).Inasoftwareengineeringcontext,theyhavebeenappliedincontrollingthetestingprocess[4][30][36][21][37],andmorerecentlytheyhavebeenusedincontrollingtheinspectionprocess[23][24].1Effectivenessisdefinedastheproportionofdefectsinadocumentthatwerefoundduringtheinspection.V13–17/05/992Whenappliedtosoftwareinspections,CRmodelscanbeusedtoestimatethenumberofdefectsintheinspecteddocument.Usingthisestimateandtheknownnumberofdefectsfound,thenumberofremainingdefectsintheinspecteddocumentcanbeestimated.Subsequently,armedwiththisinformation,theinspectionteamcanmakethedecisionastowhetherthedocumentshouldbereinspectedtoreduceitsdefectcontentbeforepassingitontothenextphaseofthelifecycle.ResearchersatBellLabsfirstappliedCRmodelsforrequirementsanddesigninspections[23][24][25].However,inthesestudiesthetruenumberofdefectswasunknownandthereforeanevaluationoftheirtrueefficacywasnotpossible.LaterworkconsistedofaMonteCarlosimulationtoevaluatetherobustnessofdifferentCRestimatorstoviolationsoftheirassumptions[50].ObjectiveempiricalevaluationofCRmodelsstartedwiththestudyofWohlinetal.[53].However,thisstudywasconductedwithnon-softwareengineeringdocuments.Subsequentworkusedsoftwareengineeringartifacts[10][12][35][44].Alloftheaboveworkutilizedmodelsthatwereoriginallydevelopedinwildliferesearch.OtherresearchersconsideredtheincorporationofBayesianmethodstoestimatedefectcontent[5],performedfurtherevaluationsofassumptionviolationswhenusingCRestimates[48],andevaluatedtheapplicabilityofCRmodelstoperspective-basedreading[12][49].Analternativeapproachwasproposedin[54],theDetectionProfileMethod(DPM).TheDPMisanintuitivelyappealingapproachthatcanbeeasilyexplainedgraphicallytononspecialists.AlaterstudysuggestedamethodforselectingbetweenaCRmodelandtheDPM[9],andthiswassubsequentlyfurtherevaluatedin[39].InadditiontotheexperiencesreportedbytheresearchersatBellLabs,theuseoftheDPMataninsurancecompanyinGermanywasreportedin[11],andtheapplicationofCRmodelsintelecommunicationsprojects[1].Therefore,thereisagrowingadoptionofdefectcontentestimationmodelsinindustrialpractice,andspecificallyCRmodels.LittleempiricalinvestigationoftheutilityofCRmodelsforinspectionswithtwoin

Evaluating Capture-Recapture Models with Two Inspe

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

千万元工程的陨落——国企ERP实施亲历记

第十一章品牌和包装

《汽车电器与电子设备》课件(广科大玉洁)第三章起动

主要施工机械表

无功补偿装置安装单位工程凉水井

12612某工程BT总承包合同最终版

建设工程相关法律法规

朱燕杰医药行业市场分析

医院舆论危机应对培训

考研时间安排及方法经验

相关文档

相关搜索