Gradient-based-learning-applied-to-document-recogn

Gradient-BasedLearningAppliedtoDocumentRecognitionYANNLECUN,MEMBER,IEEE,L´EONBOTTOU,YOSHUABENGIO,ANDPATRICKHAFFNERInvitedPaperMultilayerneuralnetworkstrainedwiththeback-propagationalgorithmconstitutethebestexampleofasuccessfulgradient-basedlearningtechnique.Givenanappropriatenetworkarchitecture,gradient-basedlearningalgorithmscanbeusedtosynthesizeacomplexdecisionsurfacethatcanclassifyhigh-dimensionalpatterns,suchashandwrittencharacters,withminimalpreprocessing.Thispaperreviewsvariousmethodsappliedtohandwrittencharacterrecognitionandcomparesthemonastandardhandwrittendigitrecognitiontask.Convolutionalneuralnetworks,whicharespeciﬁcallydesignedtodealwiththevariabilityoftwodimensional(2-D)shapes,areshowntooutperformallothertechniques.Real-lifedocumentrecognitionsystemsarecomposedofmultiplemodulesincludingﬁeldextraction,segmentation,recognition,andlanguagemodeling.Anewlearningparadigm,calledgraphtransformernetworks(GTN’s),allowssuchmultimodulesystemstobetrainedgloballyusinggradient-basedmethodssoastominimizeanoverallperformancemeasure.Twosystemsforonlinehandwritingrecognitionaredescribed.Experimentsdemonstratetheadvantageofglobaltraining,andtheﬂexibilityofgraphtransformernetworks.Agraphtransformernetworkforreadingabankcheckisalsodescribed.Itusesconvolutionalneuralnetworkcharacterrecognizerscombinedwithglobaltrainingtechniquestoproviderecordaccuracyonbusinessandpersonalchecks.Itisdeployedcommerciallyandreadsseveralmillionchecksperday.Keywords—Convolutionalneuralnetworks,documentrecog-nition,ﬁnitestatetransducers,gradient-basedlearning,graphtransformernetworks,machinelearning,neuralnetworks,opticalcharacterrecognition(OCR).NOMENCLATUREGTGraphtransformer.GTNGraphtransformernetwork.HMMHiddenMarkovmodel.HOSHeuristicoversegmentation.K-NNK-nearestneighbor.ManuscriptreceivedNovember1,1997;revisedApril17,1998.Y.LeCun,L.Bottou,andP.HaffnerarewiththeSpeechandImageProcessingServicesResearchLaboratory,AT&TLabs-Research,RedBank,NJ07701USA.Y.BengioiswiththeD´epartementd’InformatiqueetdeRechercheOp´erationelle,Universit´edeMontr´eal,Montr´eal,Qu´ebecH3C3J7Canada.PublisherItemIdentiﬁerS0018-9219(98)07863-3.NNNeuralnetwork.OCROpticalcharacterrecognition.PCAPrincipalcomponentanalysis.RBFRadialbasisfunction.RS-SVMReduced-setsupportvectormethod.SDNNSpacedisplacementneuralnetwork.SVMSupportvectormethod.TDNNTimedelayneuralnetwork.V-SVMVirtualsupportvectormethod.I.INTRODUCTIONOverthelastseveralyears,machinelearningtechniques,particularlywhenappliedtoNN’s,haveplayedanincreas-inglyimportantroleinthedesignofpatternrecognitionsystems.Infact,itcouldbearguedthattheavailabilityoflearningtechniqueshasbeenacrucialfactorintherecentsuccessofpatternrecognitionapplicationssuchascontinuousspeechrecognitionandhandwritingrecognition.Themainmessageofthispaperisthatbetterpatternrecognitionsystemscanbebuiltbyrelyingmoreonauto-maticlearningandlessonhand-designedheuristics.Thisismadepossiblebyrecentprogressinmachinelearningandcomputertechnology.Usingcharacterrecognitionasacasestudy,weshowthathand-craftedfeatureextractioncanbeadvantageouslyreplacedbycarefullydesignedlearningmachinesthatoperatedirectlyonpixelimages.Usingdocumentunderstandingasacasestudy,weshowthatthetraditionalwayofbuildingrecognitionsystemsbymanuallyintegratingindividuallydesignedmodulescanbereplacedbyauniﬁedandwell-principleddesignparadigm,calledGTN’s,whichallowstrainingallthemodulestooptimizeaglobalperformancecriterion.Sincetheearlydaysofpatternrecognitionithasbeenknownthatthevariabilityandrichnessofnaturaldata,beitspeech,glyphs,orothertypesofpatterns,makeitalmostimpossibletobuildanaccuraterecognitionsystementirelybyhand.Consequently,mostpatternrecognitionsystemsarebuiltusingacombinationofautomaticlearningtechniquesandhand-craftedalgorithms.Theusualmethod0018–9219/98$10.001998IEEE2278PROCEEDINGSOFTHEIEEE,VOL.86,NO.11,NOVEMBER1998Fig.1.Traditionalpatternrecognitionisperformedwithtwomodules:aﬁxedfeatureextractorandatrainableclassiﬁer.ofrecognizingindividualpatternsconsistsindividingthesystemintotwomainmodulesshowninFig.1.Theﬁrstmodule,calledthefeatureextractor,transformstheinputpatternssothattheycanberepresentedbylow-dimensionalvectorsorshortstringsofsymbolsthat:1)canbeeasilymatchedorcomparedand2)arerelativelyinvariantwithrespecttotransformationsanddistortionsoftheinputpat-ternsthatdonotchangetheirnature.Thefeatureextractorcontainsmostofthepriorknowledgeandisratherspeciﬁctothetask.Itisalsothefocusofmostofthedesigneffort,becauseitisoftenentirelyhandcrafted.Theclassiﬁer,ontheotherhand,isoftengeneralpurposeandtrainable.Oneofthemainproblemswiththisapproachisthattherecognitionaccuracyislargelydeterminedbytheabilityofthedesignertocomeupwithanappropriatesetoffeatures.Thisturnsouttobeadauntingtaskwhich,unfortunately,mustberedoneforeachnewproblem.Alargeamountofthepatternrecognitionliteratureisdevotedtodescribingandcomparingtherelativemeritsofdifferentfeaturesetsforparticulartasks.Historically,theneedforappropriatefeatureextractorswasduetothefactthatthelearningtechniquesusedbytheclassiﬁerswerelimited

Gradient-based-learning-applied-to-document-recogn

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

第4章建筑给水

建筑构造_室内装修构造

江苏工程资料规范宣贯教材

化工专业国际认证的实践与思考

3通信管道工程施工及验收规范

经济寒冬考验企业成本功夫

葡萄与葡萄酒中黄烷醇类多酚和果实原花色素合成相关酶

软件企业技术合同登记及相关优惠政策

城市路灯照明节能技术现状与发展趋势

从石化技术开发案例探索自主创新之路

相关文档

相关搜索