Deep-Convolutional-Network-for-Handwritten-Chinese

DeepConvolutionalNetworkforHandwrittenChineseCharacterRecognitionYuhaoZhangComputerScienceDepartmentStanfordUniversityzyh@stanford.eduAbstractInthisprojectweexploredtheperformanceofdeepcon-volutionalneuralnetworkonrecognizinghandwrittenChi-nesecharacters.Weranexperimentsona200-classanda3755-classdatasetusingconvolutionalnetworkswithdif-ferentdepthandﬁlternumbers.Experimentalresultsshowthatdeepernetworkwithlargerﬁlternumbersgivebettertestaccuracy.WealsoprovideavisualizationofthelearnednetworkonthehandwrittenChinesecharacters.1.IntroductionDeepconvolutionalneuralnetwork(CNN)hasbecomethearchitectureofchoiceforcomplexvisionrecognitionproblemsforseveralyears.TherehasbeenalotofresearchonusingdeepCNNtorecognizehandwrittendigits,En-glishalphabets,orthemoregeneralLatinalphabets.Ex-perimentshaveshownthatwell-constructeddeepCNNsarepowerfultoolstotacklethesechallenges.Astherecogni-tionofcharactersinvariouslanguageshasattractedmuchattentionintheresearchcommunity,anaturalquestionis:HowdoesdeepCNNperformforrecognizingmorecom-plexhandwrittencharacters?Inthisproject,wewillexplorethepowerofdeepCNNontheclassiﬁcationofhandwrittenChinesecharacters.ComparedtothetaskofrecognizinghandwrittendigitsandEnglishalphabets,therecognitionofhandwrittenChi-nesecharactersisamorechallengingtaskduetovariousreasons.Firstly,therearemuchmorecategoriesforChi-nesecharactersthanfordigitsandEnglishcharacters.Asacomparison,thereare10digitsforusualdigitrecognitiontasks,andthereare26alphabetsforEnglish,whilethereareintotalover50,000Chinesecharactersandaround3,000ofthemareforeverydayuse.Secondly,mostChinesechar-actershavemuchmorecomplicatedstructuresandconsistofmuchmorestrokescomparedtodigitsorEnglishcharac-ters.Figure1showsacomparisonofdifferenthandwrittencharacters.Thirdly,handwritingstyleforChinesecharac-tersvarieshugelyfrompersontoperson.Moreover,theexistenceofjoined-uphandwritingmakestherecognitionevenmoredifﬁcult.Forexample,Figure2showsthein-ﬂuenceofdifferenthandwritingstylesontheappearanceofhandwrittenChinesecharacters.Itisevenachallengingtaskforawell-educatedChinesetorecognizeallthehand-writtencharacterscorrectly.Inthisproject,wewillfocusontwospeciﬁcquestions:1)Howwillthearchitectureanddepthinﬂuencetheaccu-racyofCNNonrecognizinghandwrittenChinesecharac-ters?2)Doestheextractedfeaturesmakesenseintermsofvisualization?Therestofthereportisorganizedasfollows.Wewillﬁrstintroducedthedatasetandournetworkconﬁg-urationsinSection2andSection3.ThenwewillintroducehowweimplementandtrainournetworksinSection4.Af-terwardswepresentourexperimentalresultsinSection5andanalyzetheresultsinSection6.Finally,wewilldis-cusstherelatedworkinSection7.(a)Digit(b)English(c)ChineseFigure1:Exampleofhandwrittencharacters2.Data2.1.DatasetForthisprojectweusetheCASIAofﬂinedatabase,asdescribedin[6].Thedataconsistsofplaingray-scaleim-agesofisolatedhandwrittenChinesecharacters,asshowninFigure2.Speciﬁcally,wewillusetheHWDB1.1dataset,whichtotallyincludes3,755Chinesecharactersand171al-phanumericandsymbols.AsispresentedinTable1,eachcategorycontainshandwrittenimagesfromapproximately300writers(withminordifferenceforsomecategories),andeachwritercontributesoneimagetoeachcategory.Asre-leased,thefulldatasetissplitintotwoparts:atrainingset1Figure2:Differentdataexamplesinthesamecategory.ExamplesoneachrowcomefromdifferentwritersandcorrespondtothesameChinesecharacter:艾,斌,and棉respectively.Verydifferenthandwritingstylesacrosswriterscouldbeobserved.Dataset#Writers#Classes#TotalSamples#Chinesecharacters#SymbolsCASIAHWDB1.13003,7551,172,9071,121,74951,158Table1:HWDB1.1DatasetInformationandtestset.Testsetcontains60randomlysampledimagesforeachcategory,andtrainingsetcontainstherest(approx-imately240).Inthisproject,fordebuggingandcomparingdifferentmodelsduringtraining,wefurthersplittheorigi-naltrainingsetintotwoparts:atrainingsetandavalidationset,withtrainingsetcontaining200imagesforeachcate-goryandvalidationsetcontainstherest(approximately40).Trainingonthefulldataset(over1millionexamples)cantakemanyhoursordaysevenwiththefastestGPU.Constraintbythecomputationresourceswehaveaccessto,weranourmajorexperiments(formodelcomparisonandvisualization)onasubsetofthefulldataset,whichcontains200randomlysampledclasses.Thesizeoftrainingset,val-idationsetandtestsetforeachclassremainsunchanged.Wealsoranexperimentsonthefulldatasettoevaluatethepowerofourbestmodels.Thisfulldatasetcontainsallthe3,755classes.Itisworthnotingthattherearemuchmoreclassesthanexamplesforeachclassesinthetrainingset.TheinformationabouttwodatasetsarelistedinTable2.2.2.PreprocessingThereleaseddatasetcontainsexamplesinbinaryformat,alongwithlabels.Sotheﬁrststepofthedataprocessingistoconvertthebinarydataintoimageformat.Hereweuse.jpgtoencodetheimageandstoretheimageﬁles.Theconvertedimageshavebackgroundlabeledas255andfore-groundpixelsin255graylevels(0-254).Asitisusedin[2],herewefollowedathree-stepprepro-cessingapproach:resizing,contrastmaximizationandim-agemeansubtraction.GivenarawinputimagedescribingahandwrittenChinesecharacter,weﬁrstresizedtheimageintoanormalizeds

Deep-Convolutional-Network-for-Handwritten-Chinese

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

联信光电子公司设计team模块设计员岗位说明书

XXXX年软考网络工程师综合模拟试题一

【工程物探】v6—A电磁系统在山区地下水勘测中的应用

数控设备应用与维护专业规范

8章5节－－医药

药品采购岗岗位说明书

医学影像诊断学总论(001)

仓储物流之接单审核

绩效管理课程讲义

第9章综合生产计划

相关文档

相关搜索