Deep Forest Towards An Alternative to Deep Neural

DeepForest:TowardsAnAlternativetoDeepNeuralNetworksZhi-HuaZhouandJiFengNationalKeyLaboratoryforNovelSoftwareTechnologyNanjingUniversity,Nanjing210023,Chinafzhouzh,fengjg@lamda.nju.edu.cnAbstractInthispaper,weproposegcForest,adecisiontreeensembleapproachwithperformancehighlycom-petitivetodeepneuralnetworks.Incontrasttodeepneuralnetworkswhichrequiregreateffortinhyper-parametertuning,gcForestismucheasiertotrain.Actually,evenwhengcForestisappliedtodiffer-entdatafromdifferentdomains,excellentperfor-mancecanbeachievedbyalmostsamesettingsofhyper-parameters.ThetrainingprocessofgcFor-estisefﬁcientandscalable.InourexperimentsitstrainingtimerunningonaPCiscomparabletothatofdeepneuralnetworksrunningwithGPUfacili-ties,andtheefﬁciencyadvantagemaybemoreap-parentbecausegcForestisnaturallyapttoparallelimplementation.Furthermore,incontrasttodeepneuralnetworkswhichrequirelarge-scaletrainingdata,gcForestcanworkwellevenwhenthereareonlysmall-scaletrainingdata.Moreover,asatree-basedapproach,gcForestshouldbeeasierforthe-oreticalanalysisthandeepneuralnetworks.1IntroductionInrecentyears,deepneuralnetworkshaveachievedgreatsuccessinvariousapplications,particularlyintasksinvolv-ingvisualandspeechinformation[Krizhenvskyetal.,2012;Hintonetal.,2012],leadingtothehotwaveofdeeplearning[Goodfellowetal.,2016].Thoughdeepneuralnetworksarepowerful,theyhaveap-parentdeﬁciencies.First,itiswellknownthatahugeamountoftrainingdataareusuallyrequiredfortraining,disablingdeepneuralnetworkstobeappliedtotaskswithsmall-scaledata.Notethateveninthebigdataera,manyrealtasksstilllacksufﬁcientamountoflabeleddataduetoexpensivelabelingcost,leadingtoinferiorperformanceofdeepneu-ralnetworksonthosetasks.Second,deepneuralnetworksareverycomplicatedmodelsandpowerfulcomputationalfa-cilitiesareusuallyrequiredforthetrainingprocess,encum-beringindividualsoutsidebigcompaniestofullyexploitthelearningability.Moreimportantly,deepneuralnetworksarewithtoomanyhyper-parameters,andthelearningperfor-mancedependsseriouslyoncarefultuningofthem.Forex-ample,evenwhenseveralauthorsalluseconvolutionalneu-ralnetworks[LeCunetal.,1998;Krizhenvskyetal.,2012;SimonyanandZisserman,2014],theyareactuallyusingdif-ferentlearningmodelsduetothemanydifferentoptionssuchastheconvolutionallayerstructures.Thisfactnotonlymakesthetrainingofdeepneuralnetworksverytricky,likeanartratherthanscience/engineering,butalsomakestheoreticalanalysisofdeepneuralnetworksextremelydifﬁcultbecauseoftoomanyinterferingfactorswithalmostinﬁniteconﬁgu-rationalcombinations.Itiswidelyrecognizedthattherepresentationlearningabilityiscrucialfordeepneuralnetworks.Itisalsonotewor-thythat,toexploitlargetrainingdata,thecapacityoflearningmodelsshouldbelarge;thispartiallyexplainswhythedeepneuralnetworksareverycomplicated,muchmorecomplexthanordinarylearningmodelssuchassupportvectorma-chines.Weconjecturethatifwecanendowthesepropertiestosomeothersuitableformoflearningmodels,wemaybeabletoachieveperformancecompetitivetodeepneuralnet-worksbutwithlessaforementioneddeﬁciencies.Inthispaper,weproposegcForest(multi-GrainedCascadeforest),anoveldecisiontreeensemblemethod.Thismethodgeneratesadeepforestensemble,withacascadestructurewhichenablesgcForesttodorepresentationlearning.Itsrepresentationallearningabilitycanbefurtherenhancedbymulti-grainedscanningwhentheinputsarewithhighdimen-sionality,potentiallyenablinggcForesttobecontextualorstructuralaware.Thenumberofcascadelevelscanbeadap-tivelydeterminedsuchthatthemodelcomplexitycanbeau-tomaticallyset,enablinggcForesttoperformexcellentlyevenonsmall-scaledata.ItisnoteworthythatgcForesthasmuchfewerhyper-parametersthandeepneuralnetworks;evenbet-ternewsisthatitsperformanceisquiterobusttohyper-parametersettings,suchthatinmostcases,evenacrossdif-ferentdatafromdifferentdomains,itisabletogetexcellentperformancebyusingthedefaultsetting.ThisnotonlymakesthetrainingofgcForestconvenient,butalsomakestheoreti-calanalysis,althoughbeyondthescopeofthispaper,easierthandeepneuralnetworks(needlesstosaythattreelearnersaretypicallyeasiertoanalyzethanneuralnetworks).Inourexperiments,gcForestachieveshighlycompetitiveorevenbetterperformancethandeepneuralnetworks,whereasthetrainingtimecostofgcForestrunningonaPCiscomparabletothatofdeepneuralnetworksrunningwithGPUfacilities.Notethattheefﬁciencyadvantagecanbemoreapparentbe-arXiv:1702.08835v1[cs.LG]28Feb2017Figure1:Illustrationofthecascadeforeststructure.Eachlevelofthecascadeconsistsoftworandomforests(blue)andtwocomplete-randomtreeforests(black).Supposetherearethreeclassestopredict;thus,eachforestwilloutputathree-dimensionalclassvector,whichisthenconcatenatedforre-representationoftheoriginalinput.causegcForestisnaturallyapttoparallelimplementation.Webelievethattotacklecomplicatedlearningtasks,itislikelythatlearningmodelshavetogodeep.Currentdeepmodels,however,arealwaysneuralnetworks.Thispaperil-lustrateshowtoconstructdeepforest,anditmayopenadoortowardsalternativetodeepneuralnetworksformanytasks.InthenextsectionswewillintroducegcForestandreportonexperiments,followedbyrelatedworkandconclusion.2TheP

Deep Forest Towards An Alternative to Deep Neural

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

2月房地产分析

河南省交通科技政策选择

化工制图全套PPT课件第二章

性成分分析及在卷烟

无线中继在下一代移动通信系统中的应用

北京市固定电话入网合同(doc6)(1)

教育创新与创新人才培养

单层工业厂房吊装方案设计

关于董事任职资格的思考(1)

项目计划与控制概述ppt-第一章项目计划与控制概述

相关文档

相关搜索