您好,欢迎访问三七文档
当前位置:首页 > 商业/管理/HR > 管理学资料 > ism(implicit-shape-mode)
1ComputerVision:ImplicitShapeModelDr.EdgarSeemannseemann@pedestrian-detection.comComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann2LocalFeaturesComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann3SofarPartsweredefinedmanuallyPartsrepresentedthesemanticstructurei.e.face,legetc.Questions:Dothesepartsdecomposethevariabilityinanoptimalway?MustthepartshaveasemanticmeaningShouldweusesmaller/largerparts?Canwefindpartsautomatically?ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann4RequirementsforpartdecompositionRepeatablei.e.weshouldbeabletofindthepartdespitearticulationorimagetransformations(e.g.rotation,perspective,lighting)DistinctivePartshouldnotbeconfoundedwithotherpartsTheregionsshouldcontainan“interesting”structureCompactTypicallynolengthyorstrangelyshapedpartsEfficientItshouldbecomputationallyinexpensivetodetectorrepresentpartCoverpartsneedtosufficientlycovertheobjectComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann5ApproachNpixelsNpixelsSimilaritymeasureAfe.g.colorBfe.g.colorA1A2A3TffdBA),(1.Findasetofdistinctivekey-points3.Extractandnormalizetheregioncontent2.Definearegionaroundeachkeypoint4.Computealocaldescriptorfromthenormalizedregion5.MatchlocaldescriptorsComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann6HessianDetector[Beaudet78]HessiandeterminantIxxIyyIxy2))(det(xyyyxxIIIIHessian2)^(.xyyyxxIIIInMatlab:yyxyxyxxIIIIIHessian)(ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann7AutomaticScaleSelectionFunctionresponsesforincreasingscale(scalesignature))),((1xIfmii)),((1xIfmiiComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann8Results:Laplacian-of-GaussianComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann9ImplicitShapeModelComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann1010K.Grauman,B.LeibeImplicitShapeModel(ISM)Basicideas1.AutomaticallylearnalargenumberoflocalpartsthatoccurontheobjectAlsoreferredtoasvisualvocabularyorappearancecodebook2.Learnastar-topologystructuralmodelFeaturesareconsideredindependentgivenobj.centerx1x3x4x6x5x2ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann11VisualVocabulary/AppearanceCodebookComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann12VisualVocabularyDetectkeypointsonalltrainingexamplesExtractfeaturedescriptionsaroundkeypointsResult:AlargesetoflocalimagedescriptorsoccurringonpeopleComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann13VisualVocabularyGroupvisuallysimilarlocaldescriptorsi.e.partsthatarereoccurringParts,thatoccuronlyoncearediscarded(theycouldresultfromnoiseorunusualstructures)ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann14SideNote:GroupingAlgorithmsPartitionalClusteringK-MeansGaussianMixtureClustering(EM)HierarchicalorAgglomerativeClusteringSingle-LinkGroupAverageWard’smethod(minimumvariance)ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann16ComplexityStandardApproach:Timecomplexity:O(n2logn)ComputedistancematrixConsecutivelymergethetwomostsimilarclustersSpacecomplexity:O(n2)RNNAlgorithm[deRham’80,Benzecri’82]Timecomplexity:O(n2)Spacecomplexity:O(n)Requirement:“reducibilityproperty”[Bruynooghe’77]ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann17SpaceComplexityNote,thatspacecomplexityisquiteimportantforclusteringlargedatasetsExample:100000datapointsStandarddistancematrixcontains:105*105/2=1010/2entries-~20GBifoneentryhas32bit-DoesyourPChaveenoughRAM?ComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann18AgglomerativeclusteringproducesahierarchyDifficultquestion:wheretostop?Ideally,clustersshouldbevisuallycompact.ButDistancevaluedependsonfeaturedimensionality.Appropriateratio#features/#clustersdependsondatasetandinterestpointdetector.Needstobeselectedforeachdetector/descriptorcombination!ClusteringHierarchyComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann19VisualVocabularyVocabularysize~10000clustersProbabilisticvotesdecide,whetherpartisimportantornotComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann20LearningSpatialStructure:“Star”-ModelComputerVisionforHuman-ComputerInteractionResearchGroup,UniversitätKarlsruhe(TH)cv:hciDr.EdgarSeemann211.LearnappearancecodebookExtractlocal
本文标题:ism(implicit-shape-mode)
链接地址:https://www.777doc.com/doc-1861907 .html