Monte carlo hidden Markov models

MonteCarloHiddenMarkovModelsSebastianThrunandJohnLangfordDecember1998CMU-CS-98-179SchoolofComputerScienceCarnegieMellonUniversityPittsburgh,PA15213AbstractWepresentalearningalgorithmforhiddenMarkovmodelswithcontinuousstateandobserva-tionspaces.Allnecessaryprobabilitydensityfunctionsareapproximatedusingsamples,alongwithdensitytreesgeneratedfromsuchsamples.AMonteCarloversionofBaum-Welch(EM)isemployedtolearnmodelsfromdata,justasinregularHMMlearning.Regularizationduringlearningisobtainedusinganexponentialshrinkingtechnique.Theshrinkagefactor,whichdeter-minestheeffectivecapacityofthelearningalgorithm,isannealeddownovermultipleiterationsofBaum-Welch,andearlystoppingisappliedtoselecttherightmodel.Weprovethatundermildassumptions,MonteCarloHiddenMarkovModelsconvergetoalocalmaximuminlikeli-hoodspace,justlikeconventionalHMMs.Inaddition,weprovideempiricalresultsobtainedinagesturerecognitiondomain,whichillustratetheappropriatenessoftheapproachinpractice.ThisresearchissponsoredinpartbyDARPAviaAFMSC(contractnumberF04701-97-C-0022),TACOM(con-tractnumberDAAE07-98-C-L032),andRomeLabs(contractnumberF30602-98-2-0137).Theviewsandconclusionscontainedinthisdocumentarethoseoftheauthorsandshouldnotbeinterpretedasnecessarilyrepresentingofﬁcialpoliciesorendorsements,eitherexpressedorimplied,ofDARPA,AFMSC,TACOM,RomeLabs,ortheUnitedStatesGovernment.Keywords:annealing,any-timealgorithms,Baum-Welch,densitytrees,earlystopping,EM,hiddenMarkovmodels,machinelearning,maximumlikelihoodestimation,MonteCarlomethods,temporalsignalprocessingMonteCarloHiddenMarkovModels11IntroductionOverthelastdecadeorso,hiddenMarkovmodelshaveenjoyedanenormouspracticalsuccessinalargerangeoftemporalsignalprocessingdomains.HiddenMarkovmodelsareoftenthemethodofchoiceinareassuchasspeechrecognition[28,27,42],naturallanguageprocessing[5],robotics[34,23,48],biologicalsequenceanalysis[17,26,40],andtimeseriesanalysis[16,55].Theyarewell-suitedformodeling,ﬁltering,classiﬁcationandpredictionoftimesequencesinarangeofpartiallyobservable,stochasticenvironments.Withfewexceptions,existingHMMalgorithmsassumethatboththestatespaceoftheenvi-ronmentanditsobservationspacearediscrete.Someresearchershavedevelopedalgorithmsthatsupportmorecompactfeature-basedstaterepresentations[15,46]whichareneverthelessdiscrete;othershavesuccessfullyproposedHMMmodelsthatcancopewithreal-valuedobservationspaces[29,19,48].Kalmanﬁlters[21,56]canbethoughtofasHMMswithcontinuousstateandactionspaces,whereboththestatetransitionandtheobservationdensitiesarelinear-Gaussianfunctions.Kalmanﬁltersassumethattheuncertaintyinthestateestimationisalwaysnormallydistributed(andhenceunimodal),whichistoorestrictiveformanypracticalapplicationdomains(seee.g.,[4,18]).Incontrast,most“natural”statespacesandobservationspacesarecontinuous.Forexample,thestatespaceofthevocaltractofhumanbeings,whichplaysaprimaryroleinthegenerationofspeech,iscontinuous;yetHMMstrainedtomodelthespeech-generatingprocessaretypicallydiscrete.Robots,tonameasecondexample,alwaysoperateincontinuousspaces;hencetheirstatespacesareusuallybestmodeledbycontinuousstatespaces.Manypopularsensors(cam-eras,microphones,rangeﬁnders)generatereal-valuedmeasurements,whicharebettermodeledusingcontinuousobservationspaces.Inpractice,however,real-valuedobservationspacesareusuallytruncatedintodiscreteonestoaccommodatethelimitationsofconventionalHMMs.Apopularapproachalongtheselinesistolearnacode-book(vectorquantizer),whichclustersreal-valuedobservationsintoﬁnitelymanybins,andthusmapsreal-valuedsensormeasurementsintoadiscretespaceofmanageablesize[54].ThediscretenessofHMMsisinstarkcontrasttothecontinuousnatureofmanystateandobservationspaces.ExistingHMMalgorithmspossessaseconddeﬁciency,whichisfrequentlyaddressedintheAIliterature,butrarelyintheliteratureonHMMs:theydonotprovidemechanismsforadaptingtheircomputationalrequirementstotheavailableresources.Thisisunproblematicindomainswherecomputationcanbecarriedoutoff-line.However,trainedHMMsarefrequentlyemployedintime-criticaldomains,wheremeetingdeadlinesisessential.Any-timealgorithms[9,58]addressthisissue.Any-timealgorithmscangenerateanansweratanytime;however,thequalityofthesolutionincreaseswiththetimespentcomputingit.Anany-timeversionofHMMswouldenablethemtoadapttheircomputationalneedstowhatisavailable,thusprovidingmaximumﬂexibilityandaccuracyintime-criticaldomains.MarryingHMMswithany-timecomputationisthereforeadesirablegoal.ThispaperpresentsMonteCarloHiddenMarkovModels(MCHMMs).MCHMMsemployscontinuousstateandobservationspaces,andoncetrained,theycanbeusedinanany-timefashion.OurapproachemploysMonteCarlomethodsforapproximatingalarge,non-parametricclassofdensityfunctions.Tocombinemultipledensities(e.g.,withBayesrule),ittransformssamplesets2SebastianThrunandJohnLangfordintodensitytrees.Sincecontinuousstatespacesaresufﬁcientlyrichtooverﬁtanydataset,ourapproachusesshrinkageasmechanismforregularization.Theshrinkagefactor,whichdeterminestheeffectivecapacityoftheHMM,isannealeddownovermultipleiterationsofEM,andearlystoppingisappliedtochoosetherightmodel.W

Monte carlo hidden Markov models

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

品管管理补充资料——品质大师之理念简介(ppt 170)

一篇震动所有中国人的演讲

物流配送中心设计第六章

战略管理之专业化战略

集体劳动法讲义

经济法救命稻草

1万吨粉煤灰造纸项目建议书-厦门市榕薪环保设备有限公司

财务人员笔试试题(有答案)

家政服务员技能培训

国学礼仪教案

相关文档

相关搜索