Computing gaussian mixture models with EM using eq

LEIBNIZCENTERFORRESEARCHINCOMPUTERSCIENCETECHNICALREPORT2003-43ComputingGaussianMixtureModelswithEMusingEquivalenceConstraintsNoamShental,AharonBar-Hillel,TomerHertzandDaphnaWeinshallSchoolofComputerScienceandEngineeringandtheCenterforNeuralComputationTheHebrewUniversityofJerusalem,Jerusalem,Israel91904Email:ffenoam,aharonbh,tomboy,daphnag@cs.huji.ac.ilAbstractGaussianmixturemodelsfordensityestimationareusuallyestimatedinanunsupervisedmanner,usinganExpectationMaximization(EM)procedure.Inthispaperweshowhowequivalenceconstraintscanbeincorporatedintothisprocedure,leadingtoimprovedmodelestimationandimprovedclusteringresults.Equivalenceconstraintsprovideadditionalinformationonpairsofdatapoints,indicatingifthepointsarisefromthesamesource(positiveconstraint)orfromdifferentsources(negativeconstraint).Suchconstraintscanbegatheredautomaticallyinsomelearningproblems,andareanaturalformofsupervisioninothers.WepresentaclosedformEMprocedureforhandlingpositiveconstraints,andaGeneralizedEMprocedureusingaMarkovnetworkfortheincorporationofnegativeconstraints.Usingpubliclyavailabledatasets,wedemonstratethatincorporatingequivalenceconstraintsleadstoconsiderableimprovementinclusteringperformance,andthatouralgorithmoutperformsallavailablecompetitors.Keywords:semi-supervisedlearning,equivalenceconstraints,clustering,EM,Guassianmixturemod-els1IntroductionGaussianMixtureModels(GMM)fordensityestimationarepopularfortwomainreasons:theycanbereliablycomputedbytheefﬁcientExpectationMaximization(EM)algorithm[1],andtheyprovideagener-ativemodelforthewaythedatamayhavebeencreated.Thesecondpropertyinparticularmakesfortheircommonuseforunsupervisedclustering,wheretypicallytheGaussiancomponentsoftheGMMmodelaretakentorepresentdifferentsources.Thisuseiscommonbecausemostotherclusteringalgorithmsarenotgenerative,andthereforecannotprovidepredictionsregardingpreviouslyunseenpoints.1Whenusedforclusteringinthisway,theunderlyingassumption-i.e.,thatthedensityiscomprisedofamixtureofdifferentGaussiansources-isoftenhardtojustify.Itisthereforeimportanttohaveadditionalinformation,whichcansteertheGMMestimationinthe“right”direction.Forexamplewemayhaveaccesstothelabelsofpartofthedataset.Nowtheestimationproblembelongstothesemi-supervisedlearningdomain,sincetheestimationreliesonbothlabeledandunlabeledpoints.Inthispaperwefocusonanothertypeofside-information,inwhichequivalenceconstraintsbetweenafewofthedatapointsareprovided.Morespeciﬁcally,weuseanunlabeleddatasetaugmentedbyequiv-alenceconstraintsbetweenpairsofdatapoints,wheretheconstraintsdeterminewhethereachpairwasgeneratedbythesamesourceorbydifferentsources.Wedenotetheformercaseas‘positive’constraints,andthelattercaseas‘negative’constraints,andpresentamethodtoincorporatethemintoanEMprocedure.Whatdoweexpecttogainfromthesemi-supervisedapproachtoGMMestimation?Wemayhopethatintroducingside-informationintotheEMalgorithmwillresultinfasterconvergencetoasolutionofhigherlikelihood.Butmuchmoreimportantly,ourequivalenceconstraintsshouldchangetheGMMlikelihoodfunction.Asaresult,theestimationproceduremaychoosedifferentsolutions,whichwouldhaveotherwisebeenrejectedduetotheirlowrelativelikelihoodintheunconstrainedGMMdensitymodel.Ideallythesolutionobtainedwithsideinformationwillbemorefaithfultothedesiredresults.AsimpleexampledemonstratingthispointisshowninFig.1.Unconstrainedconstrainedunconstrainedconstrained(a)(b)Figure1:Illustrativeexamplestodemonstratetheaddedvalueofequivalenceconstraints.(a)Thedatasetconsistsoftwoverticallyalignedclasses:left-givennoadditionalinformation,theEMalgorithmidentiﬁestwohorizontalclasses,andthiscanbeshowntobethemaximumlikelihoodsolution(withloglikelihoodof3500vs.loglikelihoodof2800forthesolutionshownontheright);right-additionalsideinformationintheformofequivalenceconstraintschangestheprobabilityfunctionandwegetaverticalpartitionasthemostlikelysolution.(b)Thedatasetconsistsoftwoclasseswithpartialoverlap:left-withoutconstraintsthemostlikelysolutionincludestwonon-overlappingsources;right-withconstraintsthecorrectmodelwithoverlappingclasseswasretrievedasthemostlikelysolution.Inallplotsonlytheclassassignmentofnovelun-constrainedpointsisshown.Whydoweuseequivalenceconstraints,ratherthanpartiallabelsasinpriorwork(summarizedbelow)?Ourbasicobservationisthatunlikelabels,inmanyunsupervisedlearningtasksequivalenceconstraintsmaybeextractedwithminimaleffortorevenautomatically.OneexampleiswhenthedataisinherentlysequentialandcanbemodelledbyaMarkovianprocess.Considerforexampleasecuritycameraapplication,wheretheobjectiveistoﬁndalltheframesinwhichthesameintruderappears.Duetothecontinuousnatureofthedata,intrudersextractedfromsuccessiveframesinrthesameclipcanbeassumedtocomefromthesameperson,thusformingpositiveconstraints.Inaddition,twointruderswhichappearsimultaneouslyinfrontoftwocamerascannotbethesameperson,henceanegativeconstraintisautomaticallyestablished.Anotheranalogousexampleisspeakersegmentationandrecognition,inwhichtheconversationbetweenseveralspeakersneedstobesegmentedandclusteredaccordingtospeakeridentity.Here,itma

Computing gaussian mixture models with EM using eq

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

DBA浅谈Oracle EBS11i系统性能优化

XXXX国际家具与室内设计

某电力局高层主楼工程施工组织设计大纲122

建筑业增值税进项税抵扣注意事项

监理质量管理讲义(1)监理工作的基本知识

曲靖市县域经济发展的时空特征分析

项目部整套管理制度范本

《IT项目管理》实验指导书

JAPPE2010招标公告（中文）-入札公告

林业经济管理学1

相关文档

相关搜索