2D Human Pose Estimation New Benchmark and State o

2DHumanPoseEstimation:NewBenchmarkandStateoftheArtAnalysisMykhayloAndriluka1,3,LeonidPishchulin1,PeterGehler2,andBerntSchiele11MaxPlanckInstituteforInformatics,Germany2MaxPlanckInstituteforIntelligentSystems,Germany3StanfordUniversity,USAAbstractHumanposeestimationhasmadesigniﬁcantprogressduringthelastyears.Howevercurrentdatasetsarelimitedintheircoverageoftheoverallposeestimationchallenges.Stilltheseserveasthecommonsourcestoevaluate,trainandcomparedifferentmodelson.Inthispaperweintro-duceanovelbenchmark“MPIIHumanPose”1thatmakesasigniﬁcantadvanceintermsofdiversityanddifﬁculty,acontributionthatwefeelisrequiredforfuturedevelop-mentsinhumanbodymodels.Thiscomprehensivedatasetwascollectedusinganestablishedtaxonomyofover800humanactivities[1].Thecollectedimagescoverawidervarietyofhumanactivitiesthanpreviousdatasetsincludingvariousrecreational,occupationalandhouseholdingactiv-ities,andcapturepeoplefromawiderrangeofviewpoints.Weprovidearichsetoflabelsincludingpositionsofbodyjoints,full3Dtorsoandheadorientation,occlusionlabelsforjointsandbodyparts,andactivitylabels.Foreachim-ageweprovideadjacentvideoframestofacilitatetheuseofmotioninformation.Giventheserichannotationsweper-formadetailedanalysisofleadinghumanposeestimationapproachesandgaininginsightsforthesuccessandfail-uresofthesemethods.1.IntroductionRecentposeestimationmethodsemploycomplexap-pearancemodels[2,9,15]andrelyonlearningalgorithmstoestimatemodelparametersfromthetrainingdata.Theperformanceoftheseapproachescruciallydependsontheavailabilityoftheannotatedtrainingimagesthatarerep-resentativefortheappearanceofpeopleclothing,strongarticulation,partial(self-)occlusionsandtruncationatim-ageborders.Althoughthereexiststrainingsetsforspecialscenariossuchassportscenes[12,13]anduprightpeople[17,2],thesebenchmarksarestilllimitedintheirscopeandvariabilityofrepresentedactivities.Sportscenedatasets1Availableathuman-pose.mpi-inf.mpg.de.typicallyincludehighlyarticulatedposes,butarelimitedwithrespecttovariabilityofappearancesincepeoplearetypicallywearingtightsportsoutﬁts.Inturn,datasetssuchas“FashionPose”[2]and“Armlets”[9]aimtocollectim-agesofpeoplewearingavarietyofdifferentclothingtypes,andincludeocclusionsandtruncationbutaredominatedbypeopleinsimpleuprightstandingposes.Tothebestofourknowledgenoattempthasbeenmadetoestablishamorerepresentativebenchmarkaimingtocoverawidepalletofchallengesforhumanposeestima-tion.Webelievethatthishindersfurtherdevelopmentonthistopicandproposeanewbenchmark“MPIIHumanPose”.Ourbenchmarksigniﬁcantlyadvancesstateoftheartintermsofappearancevariabilityandcomplexity,andincludesmorethan40,000imagesofpeople.WeusedYouTubeasadatasourceandcollectedimagesandimagesequencesusingqueriesbasedonthedescriptionsofmorethan800activities.Thisresultsinadiversesetofimagescoveringnotonlydifferentactivities,butindoorandout-doorscenes,avarietyofimagingconditions,aswellasbothamateurandprofessionalrecordings(c.f.Fig.1).Thisal-lowsustostudyexistingbodyposeestimationtechniquesandidentifytheirindividualfailuremodes.RelatedworkThecommonlyusedpubliclyavailabledatasetsforevaluationof2DhumanposeestimationaresummarizedinTab.1accordingtotheyearofthecor-respondingpublication.Bothfullbodyandupperbodydatasetsareincluded.Existingbenchmarkscoveraspectsofthehumanposeestimationtasksuchassportscenes[12,21],frontal-facingpeople[8,3,17],peopleinteractingwithobjects[23],poseestimationingroupphotos[5]andposeestimationofpeo-pleperformingsynchronizedactivities[4].Earlierdatasetssuchas“Parse”[16]and“Buffy”[8]arestillcommonlyfoundinevaluations[22,15].Howeverthesmalltrainingsetsincludedinthesedatasetsmakethemun-suitablefortrainingmodelswithcomplexappearancerepre-sentationsandmultiplecomponents[13,17,2],whichhavebeenshowntoperformbest.1bicyclingconditioningexercisedancingﬁshingandhuntingbicycling,BMXskimachineballroomﬁsh.fromriverbankhomeactivitieshomerepairinactivityquietlawnandgardentanninghidescarpentrysittingquietlydrivingtractormiscellaneousmusicplayingoccupationreligiousactivitiesstandingviolin,sittinghorsegroomingsit.,playinginstrum.runningselfcaresportstransportationrunning,stairs,uptakingmedicationsoccerridinginabusvolunteeractivitieswalkingwateractivitieswinteractivitiesplayingwithchildrenbirdwatchingsnorkelingskating,icedancingFigure1.Randomlychosenimagesfromeachof20activitycat-egoriesoftheproposed“MPIIHumanPose”dataset.Imagecap-tionsindicateactivitycategory(1strow)andactivity(2ndrow).Toviewthefulldatasetvisithuman-pose.mpi-inf.mpg.de.Someeffortshavebeenmadetocollectlargersetsofimages.Forexample[13]extendstheLSPdatasetto10;000imagesofpeopleperforminggymnastics,athleticsandparkour.[2]proposesalarge“FashionPose”datasetcollectedfromfashionblogs.Thisdatasetaimstocoverawidevarietyinpeopleclothing.TheLSPandFashion-Posedatasetsarecomplementaryandfocusontwodifferentchallengesforhumanposeestimation:posevariabilityandvariabilityofpeopleappearance.Howeversincetheyarecollectedwithaspeciﬁcfocusinmind,thesedatasetsdonotcoverreal-lifechallengessuchastruncation,occlusionsbysceneobjectsandvariabilityofimagingc

2D Human Pose Estimation New Benchmark and State o

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

电子产品工艺与实训——说课

滴灌施工组织设计

(一)公司法定中文名称四川汇源光通信股份有限公司

酒泉市肃州区屯升小学标准化学校创建评估自评报告

民营泰安协和医院竞争战略研究

西藏发展：XXXX年半年度报告摘要

李宁人力资源咨询项目建议书

融资融券对证券市场的影响

英语国际音标表(48个)word版

水浒传108位好汉的名字和绰号

相关文档

相关搜索