An empirical study of learning speed in back-propa

AnEmpiricalStudyofLearningSpeedinBack-PropagationNetworksScottE.FahlmanSeptember1988CMU-CS-88-162AbstractMostconnectionistorneuralnetworklearningsystemsusesomeformoftheback-propagationalgorithm.However,back-propagationlearningistooslowformanyapplications,anditscalesuppoorlyastasksbecomelargerandmorecomplex.Thefactorsgoverninglearningspeedarepoorlyunderstood.Ihavebegunasystematic,empiricalstudyoflearningspeedinbackprop-likealgorithms,measuredagainstavarietyofbenchmarkproblems.Thegoalistwofold:todevelopfasterlearningalgorithmsandtocontributetothedevelopmentofamethodologythatwillbeofvalueinfuturestudiesofthiskind.Thispaperisaprogressreportdescribingtheresultsobtainedduringthefirstsixmonthsofthisstudy.TodateIhavelookedonlyatalimitedsetofbenchmarkproblems,buttheresultsontheseareencouraging:Ihavedevelopedanewlearningalgorithmthatisfasterthanstandardbackpropbyanorderofmagnitudeormoreandthatappearstoscaleupverywellastheproblemsizeincreases.ThisresearchwassponsoredinpartbytheNationalScienceFoundationunderContractNumberEET-8716324andbytheDefenseAdvancedResearchProjectsAgency(DOD),ARPAOrderNo.4976underContractF33615-87-C-1499andmonitoredbytheAvionicsLaboratory,AirForceWrightAeronauticalLaboratories,AeronauticalSystemsDivision(AFSC),Wright-PattersonAFB,OH45433-6543.Theviewsandconclusionscontainedinthisdocumentarethoseoftheauthorsandshouldnotbeinterpretedasrepresentingtheofficialpolicies,eitherexpressedorimplied,oftheseagenciesoroftheU.S.Government.11.IntroductionNote:InthispaperIwillnotattempttoreviewthebasicideasofconnectionismorback-propagationlearning.See[3]forabriefoverviewofthisareaand[10],chapters1-8,foradetailedtreatment.WhenIrefertostandardback-propagationinthispaper,Imeantheback-propagationalgorithmwithmomentum,asdescribedin[9].Thegreatestsingleobstacletothewidespreaduseofconnectionistlearningnetworksinreal-worldapplicationsistheslowspeedatwhichthecurrentalgorithmslearn.Atpresent,thefastestlearningalgorithmformostpurposesisthealgorithmthatisgenerallyknownasback-propagationorbackprop[6,7,9,18].Theback-propagationlearningalgorithmrunsfasterthanearlierlearningmethods,butitisstillmuchslowerthanwewouldlike.Evenonrelativelysimpleproblems,standardback-propagationoftenrequiresthecompletesetoftrainingexamplestobepresentedhundredsorthousandsoftimes.Thismeansthatwearelimitedtoinvestigatingrathersmallnetworkswithonlyafewthousandtrainableweights.Someproblemsofreal-worldimportancecanbetackledusingnetworksofthissize,butmostofthetasksforwhichconnectionisttechnologymightbeappropriatearemuchtoolargeandcomplextobehandledbyourcurrentlearning-networktechnology.OnesolutionistorunournetworksimulationsonfastercomputersortoimplementthenetworkelementsdirectlyinVLSIchips.Anumberofgroupsareworkingonfasterimplementations,includingagroupatCMUthatisusingthe10-processorWarpmachine[13].Thisworkisimportant,butevenifwehadanetworkimplementeddirectlyinhardwareourslowlearningalgorithmswouldstilllimittherangeofproblemswecouldattack.Advancesinlearningalgorithmsandinimplementationtechnologyarecomplementary.Ifwecancombinehardwarethatrunsseveralordersofmagnitudefasterandlearningalgorithmsthatscaleupwelltoverylargenetworks,wewillbeinapositiontotackleamuchlargeruniverseofpossibleapplications.SinceJanuaryof1988Ihavebeenconductinganempiricalstudyoflearningspeedinsimulatednetworks.Ihavestudiedthestandardbackpropalgorithmandanumberofvariationsonstandardback-propagation,applyingthesetoasetofmoderate-sizedbenchmarkproblems.ManyofthevariationsthatIhaveinvestigatedwerefirstproposedbyotherresearchers,butuntilnowtherehavebeennosystematicstudiestocomparethesemethods,individuallyandinvariouscombinations,againstastandardsetoflearningproblems.Onlythroughsuchsystematicstudiescanwehopetounderstandwhichmethodsworkbestinwhichsituations.Thispaperisareportontheresultsobtainedinthefirstsixmonthsofthisstudy.Perhapsthemostimportantresultistheidentificationofanewlearningmethod--actuallyacombinationofseveralideas--thatonarangeofencoder/decoderproblemsisfasterthanstandardback-propagationbyanorderofmagnitudeormore.Thisnewmethodalsoappearstoscaleupmuchbetterthanstandardbackpropasthesizeandcomplexityofthelearningtaskgrows.Imustemphasizethatthisisaprogressreport.Thelearning-speedstudyisfarfromcomplete.UntilnowIhaveconcentratedmostofmyeffortonasingleclassofbenchmarks,namelytheencoder/decoderproblems.Likeanyfamilyofbenchmarkstakeninisolation,encoder/decoderproblemshavecertainpeculiaritiesthatmaybiastheresultsofthestudy.Untilamorecomprehensivesetofbenchmarkshasbeenrun,itwouldbeprematuretodrawanysweepingconclusionsormakeanystrongclaimsaboutthewidespreadapplicabilityofthesetechniques.22.Methodology2.1.WhatMakesaGoodBenchmark?Atpresentthereisnowidelyacceptedmethodologyformeasuringandcomparingthespeedofvariousconnectionistlearningalgorithms.Someresearchershaveproposednewalgorithmsbasedonlyonatheoreticalanalysisoftheproblem.Itissometimeshardtodeterminehowwellthesetheoreticalmodelsfitactualpractice.Otherresearchersimplementtheirideasandrunoneortwobenchmarkstodemonstratethespeedoftheresultings

An empirical study of learning speed in back-propa

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

X年造价师工程《工程造价计价与控制》模拟3

合同无效后处理之原则

怎样利用媒介策略来倍增广告效果

企业管理员工激励考核表

第五章财政收入和支出的核算

始发直达列车的组织与铁路运输精益生产

上岗考试试题(主设备维护部分)daan

仁爱八年级英语单词

卫星数据和产品共享服务指南

一年级下册分类与整理ppt课件[1]

相关文档

相关搜索