Characterization of the convergence of stationary

arXiv:0802.3235v2[cs.NE]25Feb2008CharacterizationoftheconvergenceofstationaryFokker–PlancklearningArturoBerronesPosgradoenIngenier´ıadeSistemasFacultaddeIngenier´ıaMec´anicayEl´ectricaUniversidadAut´onomadeNuevoLe´onAP126,Cd.Universitaria,SanNicol´asdelosGarza,NL66450,M´exicoarturo@yalma.ﬁme.uanl.mxAbstractTheconvergencepropertiesofthestationaryFokker-Planckalgorithmfortheesti-mationoftheasymptoticdensityofstochasticsearchprocessesisstudied.Theoret-icalandempiricalargumentsforthecharacterizationofconvergenceoftheestima-tioninthecaseofseparableandnonseparablenonlinearoptimizationproblemsaregiven.SomeimplicationsoftheconvergenceofstationaryFokker-Plancklearningfortheinferenceofparametersinartiﬁcialneuralnetworkmodelsareoutlined.Keywords:heuristics,optimization,stochasticsearch,statisticalmechanics1IntroductionTheoptimizationofacostfunctionwhichhasanumberoflocalminimaisarelevantsubjectinallﬁeldsofscienceandengineering.Inparticular,mostofmachinelearningproblemsarestatedlikeoftenlycomplex,optimizationtasks[1].Acommonsetupconsistinthedeﬁnitionofappropriatefamiliesofmodelsthatshouldbeselectedfromdata.Theselectionstepinvolvestheoptimizationofacertaincostorlikelihoodfunction,whichisusuallydeﬁnedonahighdimensionalparameterspace.Inotherapproachestolearning,likeBayesianinference[15,14],theentirelandscapegeneratedbytheoptimizationproblemassociatedwithasetofmodelstogetherwiththedataandthecostfunctionisrelevant.Otherareasinwhichglobaloptimizationplaysaprominentroleincludeoperationsresearch[12],optimaldesigninengineeneredsystems[18]andmanyotherimportantapplications.PreprintsubmittedtoElsevier26February2008Stochasticstrategiesforoptimizationareessentialtomanyoftheheuristictechniquesusedtodealwithcomplex,unstructuredglobaloptimizationprob-lems.Methodslikesimulatedannealing[13,19,9,24]andevolutionarypopula-tionbasedalgorithms[10,7,21,11,24],haveproventobevaluabletools,capa-bleofgivegoodqualitysolutionsatarelativelysmallcomputationaleﬀort.Inpopulationbasedoptimization,searchspaceisexploredthroughtheevolu-tionofﬁnitepopulationsofpoints.Thepopulationalternatesperiodsofself–adaptation,inwhichparticularregionsofthesearchspaceareexploredinanintensivemanner,andperiodsofdiversiﬁcationinwhichsolutionsincorporatethegainedinformationaboutthegloballandscape.Thereisalargeamountofevidencethatindicatesthatsomeexponentsofpopulationbasedalgorithmsareamongthemosteﬃcientglobaloptimizationtechniquesintermsofcom-putationalcostandreliability.Thesemethods,howeverarepurelyheuristicandconvergencetoglobaloptimaisnotguaranteed.Simulatedannealingontheotherhand,isamethodthatstatisticallyassuresglobaloptimality,butinalimitthatisverydiﬃculttoacomplishinpractice.Insimulatedanneal-ingasingleparticleexploresthesolutionspacethroughadiﬀusiveprocess.Inordertoguaranteeglobaloptimality,the“temperature”thatcharacterizethediﬀusionshouldbeloweredaccordingtoalogarithmicschedule[8].Thisconditionimplyverylongcomputationtimes.Inthiscontributiontheconvergencepropertiesofanestimationprocedureforthestationarydensityofageneralclassofstochasticsearchprocesses,re-centlyintroducedbytheauthor[2],isexplored.Bytheestimationprocedure,promisingregionsofthesearchspacecanbedeﬁnedonaprobabilisticba-sis.Thisinformationcanthenbeusedinconnectionwithalocallyadaptivestochasticordeterministicalgorithm.Preliminaryapplicationsofthisdensityestimationmethodintheimprovementofnonlinearoptimizationalgorithmscanbefoundin[22].Theoreticalaspectsonthefoundationsofthemethod,itslinkstostatisticalmechanicsandpossibleuseofthedensityestimationpro-cedureasageneraldiversiﬁcationmechanismarediscussedin[3].Inthenextsectionwegiveabriefaccountofthebasicelementsofourstationarydensityestimationalgorithm.Thereafter,theoreticalandempiricalevidenceontheconvergenceofthedensityestimationisgiven.Besidesglobaloptimization,thedensityestimationapproachmayprovideanoveltechniqueformaximumlikelihoodestimationandBayesianinference.Thispossibility,inthecontextofartiﬁcialneuralnetworktraining,isoutlinedinSection4.FinalconclusionsandremarksarepresentedinSection5.22Fokker–PlancklearningofthestationaryprobabilitydensityofastochasticsearchWenowproceedwithabriefaccountofthestationarydensityestimationprocedureonwhichthepresentworkisbased.ConsidertheminimizationofacostfunctionoftheformV(x1,x2,...,xn,...,xN)withasearchspacedeﬁnedoverL1,n≤xn≤L2,n.Astochasticsearchprocessforthisproblemismodeledby˙xn=−∂V∂xn+ε(t),(1)whereε(t)isanadditivenoisewithzeromean.Equation(1),knownasLangevinequationinthestatisticalphysicsliterature[16,25],capturestheessentialpropertiesofageneralstochasticsearch.Inparticular,thegradienttermgivesamechanismforlocaladaptation,whilethenoisetermprovidesabasicdiversi-ﬁcationstrategy.Equation(1)canbeinterpretedasanoverdampednonlineardynamicalsystemcomposedbyNinteractingparticlesinthepresenceofad-ditivewhitenoise.Thestationarydensityestimationisbasedonananalogywiththisphysicalsystem,consideringreﬂectingboundaryconditions.Itfol-lowsthatthestationaryconditionaldensityforparticlensatisfythelineardiﬀerentialequation,D∂p(xn|{xj6=n=x

Characterization of the convergence of stationary

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

伊芝伴内衣加盟招商手册

双连拱隧道施工技术

煤的气化

第六章磁性矿物

食品药品监管加强和创新社会管理的几点思考

110KV送出线路工程质量、安全生产管理制度

西直门区域价值研究

第一时间看透对方

工作态度与心态

电大劳动法网上作业03任务

相关文档

相关搜索