您好,欢迎访问三七文档
StereoPersonTrakingwithAdaptivePlan-ViewStatistialTemplatesMihaelHarvilleHewlett-PakardLaboratories1501PageMillRd.,ms1181PaloAlto,CA94304UnitedStatesAbstratAstheostofomputingper-pixeldepthimageryfromstereoamerasinrealtimehasfallenrapidlyinreentyears,interestinusingstereovisionforpersontrakinghasgreatlyinreased.Methodsthatattempttotrakpeoplediretlyinthese\amera-viewdepthimagesareonfrontedbytheirsubstantialamountsofnoiseandunreliabledata.Somereentmethodshavethereforefounditusefulto rstomputeoverhead,\plan-viewstatistisofthedepthdata,andthentrakpeopleinimagesofthesestatistis.Wedesribeanewombinationofplan-viewstatististhatbetterrepresentstheshapeoftrakedobjetsandprovidesamorerobustsubstrateforpersondetetionandtrakingthanpriorplan-viewalgorithms.Wealsointrodueanewmethodofplan-viewpersontraking,usingadaptivestatis-tialtemplatesandKalmanpredition.AdaptivetemplatesprovidemoredetailedmodelsoftrakedobjetsthanpriorhoiessuhasGaussians,andweillustratethatthetypialproblemswithtemplate-basedtrakinginamera-viewimagesareeasilyavoidedinaplan-viewframework.Weompareresultsofourmethodwiththosefortehniquesusingdi erentplan-viewstatistisorpersonmodels,and ndourmethodtoexhibitsuperiortrakingthroughhallengingphenomenasuhasomplexinter-personolusionsandloseinterations.Reasonablevaluesformostsystemparametersmaybederivedfromphysiallymeasurablequantitiessuhasaveragepersondimensions.Keywords:persontraking,plan-viewstatistis,stereodepthimages,adaptivetemplate,Kalman lterEmailaddress:harvillehpl.hp.om(MihaelHarville).URL:(MihaelHarville).PreprintsubmittedtoElsevierSiene20August20031IntrodutionManymethodsforreal-timemulti-persondetetionandtrakingwithvideoamerashavebeendesribedintheliterature.Unfortunately,fewofthese,ifany,produereliableresultsforlongperiodsoftimeinunonstrainedenviron-ments.ThispoorperformanestemsfromthemanydiÆulthallengesthatommonlybesettheproblem,amongthemostsigni antofwhihare: Segmentingthenovelordynamiobjets(\foreground)inthevideofromtherestofthesene(\bakground) Distinguishingpeoplefromotherforegroundobjetssuhasars,shoppingarts,orurtainsblowinginthewind Avoidingdistrationandonfusionduetolighting-relatedseneappearanehangessuhasshadows,inter-re etions,andglobalilluminationvariation Trakingpeoplethroughtemporaryolusions,eitherinpartorinfull,byotherpeopleorbystatiobjetsinthesene Maintainingtrakintegritywhenpeopleengageinloseinterations,ael-eraterapidly,orquiklyhangetheirbodyposeorappearanePer-pixeldepthordisparityimageryfromstereoameraso ersmuhpromisefordealingwiththeseissues.Forexample,thedistaneinformationinherentintheseimagesallowsforstraightforwardassessment,inomparisonwithtehniquesbasedonmonoularvideo,ofthe3Dloationsoftrakedobjets.Inaddition,depthdata Isapowerfulueforforegroundsegmentation Isrelativelyinsensitivetolightinge etssuhasshadowsandglobalillumi-nationhanges Providesshapeandmetrisizeinformationthatanbeusedtodistinguishpeoplefromotherforegroundobjets Allowsolusionsofpeoplebyeahotherorbybakgroundobjetstobedetetedandhandledmoreexpliitly Permitsthequikomputationofnewtypesoffeaturesformathingpersondesriptionsarosstime Providesathird,disambiguatingdimensionofpreditionintrakingInreentyears,ashardwareandsoftwareforomputingdepthimageryfromstereoamerashasbeomeinreasinglyfastandheap[2{4,1,5℄,severalper-sondetetionandtrakingmethodsthatmakeuseofreal-timedepthdatahavebeenpresented.Mostoftheseanalyzeandtrakfeatures,gradients,andsmoothlyonnetedregionsdiretlyinthedepthimagesthemselves[6{9℄.Whenthedepthimagesareaompaniedbyaspatially-andtemporally-registeredolororgraysalevideostream,theresultsofthedepth-basedanaly-sisareeasilyintegratedwiththoseextratedfromtheolororluminanedata.2Fig.1.Exampleofolor-with-depthvideoinput,obtainedusingthePointGreyTrilopsamera[1℄.Inthedepthimage,brighterpixelsindiategreaterdistanefromtheamera,andinvalid(unreliable)depthdataisshowninblak.Manyofthetraditionalframeworksfortrakinginmonoularviewsmaythenbeapplied,buttothemuhriherper-pixelfeaturespaeofappearane(olororluminane)plusshape(depth).Thismethodologyisnotasfruitfulasonemighthope,however,beauseto-day’sstereoamerasproduedepthimageswhosestatistisarefarlessleanthanthoseofstandardolorormonohromevideo.Formulti-amerastereoimplementations,whihomputedepthby ndingsmallareaorrespondenesbetweenimagepairs,unreliablemeasurementsoftenourinimageregionsoflittlevisualtexture,asisoftentheaseforwalls, oors,orpeoplewearinguniformly-oloredlothing.Thisusuallyausesmuhofadepthimagetobeunusable.Also,itisnotpossibleto ndtheorretorrespondenesinregions,usuallyneardepthdisontinuitiesinthesene,thatarevisibleinonestereoinputimagebutnottheother.Thisresultsinadditionalregionsofunreliabledata,andausestheedgesofanobjetinadepthimagetobenoisyandpoorlyalignedwiththeobjet’solorimageedges.AlloftheseproblemsareevidentinthetypialoloranddepthimagepairofFigure1.Evenatseneloationswheredepthmeasurementsareinformative,thesen-sitivityofthestereoorrespondeneomputationtoverylowlevelsofimagernoise,lighting utuation,andsenemotionleadstosubstantialdepthnois
本文标题:Stereo person tracking with adaptive plan-view tem
链接地址:https://www.777doc.com/doc-6220235 .html