A-Neural-Algorithm-of-Artistic-Style

ANeuralAlgorithmofArtisticStyleLeonA.Gatys,1;2;3AlexanderS.Ecker,1;2;4;5MatthiasBethge1;2;41WernerReichardtCentreforIntegrativeNeuroscienceandInstituteofTheoreticalPhysics,UniversityofT¨ubingen,Germany2BernsteinCenterforComputationalNeuroscience,T¨ubingen,Germany3GraduateSchoolforNeuralInformationProcessing,T¨ubingen,Germany4MaxPlanckInstituteforBiologicalCybernetics,T¨ubingen,Germany5DepartmentofNeuroscience,BaylorCollegeofMedicine,Houston,TX,USATowhomcorrespondenceshouldbeaddressed;E-mail:leon.gatys@bethgelab.orgInﬁneart,especiallypainting,humanshavemasteredtheskilltocreateuniquevisualexperiencesthroughcomposingacomplexinterplaybetweenthecon-tentandstyleofanimage.Thusfarthealgorithmicbasisofthisprocessisunknownandthereexistsnoartiﬁcialsystemwithsimilarcapabilities.How-ever,inotherkeyareasofvisualperceptionsuchasobjectandfacerecognitionnear-humanperformancewasrecentlydemonstratedbyaclassofbiologicallyinspiredvisionmodelscalledDeepNeuralNetworks.1,2HereweintroduceanartiﬁcialsystembasedonaDeepNeuralNetworkthatcreatesartisticimagesofhighperceptualquality.Thesystemusesneuralrepresentationstosepa-rateandrecombinecontentandstyleofarbitraryimages,providinganeuralalgorithmforthecreationofartisticimages.Moreover,inlightofthestrik-ingsimilaritiesbetweenperformance-optimisedartiﬁcialneuralnetworksandbiologicalvision,3–7ourworkoffersapathforwardtoanalgorithmicunder-standingofhowhumanscreateandperceiveartisticimagery.1arXiv:1508.06576v1[cs.CV]26Aug2015TheclassofDeepNeuralNetworksthataremostpowerfulinimageprocessingtasksarecalledConvolutionalNeuralNetworks.ConvolutionalNeuralNetworksconsistoflayersofsmallcomputationalunitsthatprocessvisualinformationhierarchicallyinafeed-forwardman-ner(Fig1).Eachlayerofunitscanbeunderstoodasacollectionofimageﬁlters,eachofwhichextractsacertainfeaturefromtheinputimage.Thus,theoutputofagivenlayerconsistsofso-calledfeaturemaps:differentlyﬁlteredversionsoftheinputimage.WhenConvolutionalNeuralNetworksaretrainedonobjectrecognition,theydeveloparepresentationoftheimagethatmakesobjectinformationincreasinglyexplicitalongthepro-cessinghierarchy.8Therefore,alongtheprocessinghierarchyofthenetwork,theinputimageistransformedintorepresentationsthatincreasinglycareabouttheactualcontentoftheim-agecomparedtoitsdetailedpixelvalues.Wecandirectlyvisualisetheinformationeachlayercontainsabouttheinputimagebyreconstructingtheimageonlyfromthefeaturemapsinthatlayer9(Fig1,contentreconstructions,seeMethodsfordetailsonhowtoreconstructtheim-age).Higherlayersinthenetworkcapturethehigh-levelcontentintermsofobjectsandtheirarrangementintheinputimagebutdonotconstraintheexactpixelvaluesofthereconstruc-tion.(Fig1,contentreconstructionsd,e).Incontrast,reconstructionsfromthelowerlayerssimplyreproducetheexactpixelvaluesoftheoriginalimage(Fig1,contentreconstructionsa,b,c).Wethereforerefertothefeatureresponsesinhigherlayersofthenetworkasthecontentrepresentation.Toobtainarepresentationofthestyleofaninputimage,weuseafeaturespaceoriginallydesignedtocapturetextureinformation.8Thisfeaturespaceisbuiltontopoftheﬁlterresponsesineachlayerofthenetwork.Itconsistsofthecorrelationsbetweenthedifferentﬁlterresponsesoverthespatialextentofthefeaturemaps(seeMethodsfordetails).Byincludingthefeaturecorrelationsofmultiplelayers,weobtainastationary,multi-scalerepresentationoftheinputimage,whichcapturesitstextureinformationbutnottheglobalarrangement.2Figure1:ConvolutionalNeuralNetwork(CNN).AgiveninputimageisrepresentedasasetofﬁlteredimagesateachprocessingstageintheCNN.Whilethenumberofdifferentﬁltersincreasesalongtheprocessinghierarchy,thesizeoftheﬁlteredimagesisreducedbysomedownsamplingmechanism(e.g.max-pooling)leadingtoadecreaseinthetotalnumberofunitsperlayerofthenetwork.ContentReconstructions.WecanvisualisetheinformationatdifferentprocessingstagesintheCNNbyreconstructingtheinputimagefromonlyknow-ingthenetwork’sresponsesinaparticularlayer.Wereconstructtheinputimagefromfromlayers‘conv11’(a),‘conv21’(b),‘conv31’(c),‘conv41’(d)and‘conv51’(e)oftheorig-inalVGG-Network.Weﬁndthatreconstructionfromlowerlayersisalmostperfect(a,b,c).Inhigherlayersofthenetwork,detailedpixelinformationislostwhilethehigh-levelcontentoftheimageispreserved(d,e).StyleReconstructions.OntopoftheoriginalCNNrepresentationswebuiltanewfeaturespacethatcapturesthestyleofaninputimage.ThestylerepresentationcomputescorrelationsbetweenthedifferentfeaturesindifferentlayersoftheCNN.Werecon-structthestyleoftheinputimagefromstylerepresentationsbuiltondifferentsubsetsofCNNlayers(‘conv11’(a),‘conv11’and‘conv21’(b),‘conv11’,‘conv21’and‘conv31’(c),‘conv11’,‘conv21’,‘conv31’and‘conv41’(d),‘conv11’,‘conv21’,‘conv31’,‘conv41’and‘conv51’(e)).Thiscreatesimagesthatmatchthestyleofagivenimageonanincreasingscalewhilediscardinginformationoftheglobalarrangementofthescene.3Again,wecanvisualisetheinformationcapturedbythesestylefeaturespacesbuiltondifferentlayersofthenetworkbyconstructinganimagethatmatchesthestylerepresentationofagiveninputimage(Fig1,stylereconstructions).10,11Indeedreconstructionsfromthestylefeatur

A-Neural-Algorithm-of-Artistic-Style

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

电力市场衍生交易中的若干问题研究

XXXX广州市危化品企业档案电子化管理系统及相关设备采

电动给水泵液力耦合器基础知识

XXXX 岩土工程师基础考试大纲

内部质量审核检查表(XXXX3)

炒股支撑压力与切线理论

部门经理工作职责及岗位能力要求

模板新员工入职培训PPT

中小企业私募债融资业务简介M

幼儿园食堂食品安全月度自查表

相关文档

相关搜索

A-Neural-Algorithm-of-Artistic-Style

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

电力市场衍生交易中的若干问题研究

XXXX广州市危化品企业档案电子化管理系统及相关设备采

电动给水泵液力耦合器基础知识

XXXX 岩土工程师 基础 考试大纲

内部质量审核检查表(XXXX3)

炒股支撑压力与切线理论

部门经理工作职责及岗位能力要求

模板新员工入职培训PPT

中小企业私募债融资业务简介M

幼儿园食堂食品安全月度自查表

相关文档

相关搜索

XXXX 岩土工程师基础考试大纲