Sequential-Minimal-Optimization---A-Fast-Algorithm

1SequentialMinimalOptimization:AFastAlgorithmforTrainingSupportVectorMachinesJohnC.PlattMicrosoftResearchjplatt@microsoft.comTechnicalReportMSR-TR-98-14April21,1998©1998JohnPlattABSTRACTThispaperproposesanewalgorithmfortrainingsupportvectormachines:SequentialMinimalOptimization,orSMO.Trainingasupportvectormachinerequiresthesolutionofaverylargequadraticprogramming(QP)optimizationproblem.SMObreaksthislargeQPproblemintoaseriesofsmallestpossibleQPproblems.ThesesmallQPproblemsaresolvedanalytically,whichavoidsusingatime-consumingnumericalQPoptimizationasaninnerloop.TheamountofmemoryrequiredforSMOislinearinthetrainingsetsize,whichallowsSMOtohandleverylargetrainingsets.Becausematrixcomputationisavoided,SMOscalessomewherebetweenlinearandquadraticinthetrainingsetsizeforvarioustestproblems,whilethestandardchunkingSVMalgorithmscalessomewherebetweenlinearandcubicinthetrainingsetsize.SMO’scomputationtimeisdominatedbySVMevaluation,henceSMOisfastestforlinearSVMsandsparsedatasets.Onreal-worldsparsedatasets,SMOcanbemorethan1000timesfasterthanthechunkingalgorithm.1.INTRODUCTIONInthelastfewyears,therehasbeenasurgeofinterestinSupportVectorMachines(SVMs)[19][20][4].SVMshaveempiricallybeenshowntogivegoodgeneralizationperformanceonawidevarietyofproblemssuchashandwrittencharacterrecognition[12],facedetection[15],pedestriandetection[14],andtextcategorization[9].However,theuseofSVMsisstilllimitedtoasmallgroupofresearchers.OnepossiblereasonisthattrainingalgorithmsforSVMsareslow,especiallyforlargeproblems.AnotherexplanationisthatSVMtrainingalgorithmsarecomplex,subtle,anddifficultforanaverageengineertoimplement.ThispaperdescribesanewSVMlearningalgorithmthatisconceptuallysimple,easytoimplement,isgenerallyfaster,andhasbetterscalingpropertiesfordifficultSVMproblemsthanthestandardSVMtrainingalgorithm.ThenewSVMlearningalgorithmiscalledSequentialMinimalOptimization(orSMO).InsteadofpreviousSVMlearningalgorithmsthatusenumericalquadraticprogramming(QP)asaninnerloop,SMOusesananalyticQPstep.ThispaperfirstprovidesanoverviewofSVMsandareviewofcurrentSVMtrainingalgorithms.TheSMOalgorithmisthenpresentedindetail,includingthesolutiontotheanalyticQPstep,MoredocumentsanddatumdownloadwebsiteLuZhenbo'sBlog:blog.sina.com.cn/luzhenbo2Communication&Cooperation:luzhenbo@yahoo.com.cn2heuristicsforchoosingwhichvariablestooptimizeintheinnerloop,adescriptionofhowtosetthethresholdoftheSVM,someoptimizationsforspecialcases,thepseudo-codeofthealgorithm,andtherelationshipofSMOtootheralgorithms.SMOhasbeentestedontworeal-worlddatasetsandtwoartificialdatasets.ThispaperpresentstheresultsfortimingSMOversusthestandard“chunking”algorithmforthesedatasetsandpresentsconclusionsbasedonthesetimings.Finally,thereisanappendixthatdescribesthederivationoftheanalyticoptimization.1.1OverviewofSupportVectorMachinesVladimirVapnikinventedSupportVectorMachinesin1979[19].Initssimplest,linearform,anSVMisahyperplanethatseparatesasetofpositiveexamplesfromasetofnegativeexampleswithmaximummargin(seefigure1).Inthelinearcase,themarginisdefinedbythedistanceofthehyperplanetothenearestofthepositiveandnegativeexamples.TheformulafortheoutputofalinearSVMisuwxb=⋅-rr,(1)wherewisthenormalvectortothehyperplaneandxistheinputvector.Theseparatinghyperplaneistheplaneu=0.Thenearestpointslieontheplanesu=±1.Themarginmisthusmw=12||||.(2)Maximizingmargincanbeexpressedviathefollowingoptimizationproblem[4]:min||||(),,,rrrrwbiiwywxbi1221subjectto⋅-≥∀(3)PositiveExamplesNegativeExamplesMaximizedistancestonearestpointsSpaceofpossibleinputsFigure1AlinearSupportVectorMachineMoredocumentsanddatumdownloadwebsiteLuZhenbo'sBlog:blog.sina.com.cn/luzhenbo2Communication&Cooperation:luzhenbo@yahoo.com.cn3wherexiistheithtrainingexample,andyiisthecorrectoutputoftheSVMfortheithtrainingexample.Thevalueyiis+1forthepositiveexamplesinaclassand–1forthenegativeexamples.UsingaLagrangian,thisoptimizationproblemcanbeconvertedintoadualformwhichisaQPproblemwheretheobjectivefunctionΨissolelydependentonasetofLagrangemultipliersαi,min()min(),rrrrrααααααΨ=⋅-===∑∑∑12111yyxxijNiNjijijiiN(4)(whereNisthenumberoftrainingexamples),subjecttotheinequalityconstraints,αii≥∀0,,(5)andonelinearequalityconstraint,yiiNi=∑=10α.(6)Thereisaone-to-onerelationshipbetweeneachLagrangemultiplierandeachtrainingexample.OncetheLagrangemultipliersaredetermined,thenormalvectorrwandthethresholdbcanbederivedfromtheLagrangemultipliers:rrrrwyxbwxyiiNiikk==⋅-=∑10αα,.forsomek(7)Becauserwcanbecomputedviaequation(7)fromthetrainingdatabeforeuse,theamountofcomputationrequiredtoevaluatealinearSVMisconstantinthenumberofnon-zerosupportvectors.Ofcourse,notalldatasetsarelinearlyseparable.Theremaybenohyperplanethatsplitsthepositiveexamplesfromthenegativeexamples.Intheformulationabove,thenon-separablecasewouldcorrespondtoaninfinitesolution.However,in1995,Cortes&Vapnik[7]suggestedamodificationtotheoriginaloptimizationstatement(3)whichallows,butpenalizes,thefailureofanexampletoreachthecorrectmargin.Thatmodificationis:min||||(),,,,rrrrrwbiiNiiiwCywxbiξξξ12211+⋅-≥-∀=∑subjectto(8)whereξiareslackvaria

Sequential-Minimal-Optimization---A-Fast-Algorithm

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

住宅智能化管理工程方案

高等工科院校适用机械制图及微机绘图

民用建筑构造概述(2学时)

石楼镇对莲花山粤海度假村扩建工程开展专项检查

金融危机下的HR应对策略

aor_1222_旅游业股份有限公司人事管理规章(doc)

某连锁餐饮企业材料审核岗位职责

第3章正弦波振荡器(1)-大连海事大学本科教学质量与教学

2公立医院法人治理结构探讨XXXX0328卫生部医管所讲课

第二章个性价值观与工作态度(第3章)

相关文档

相关搜索

Sequential-Minimal-Optimization---A-Fast-Algorithm

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

住宅智能化管理工程方案

高等工科院校适用机械制图及微机绘图

民用建筑构造概述(2学时)

石楼镇对莲花山粤海度假村扩建工程开展专项检查

金融危机下的HR应对策略

aor_1222_旅游业股份有限公司人事管理规章(doc)

某连锁餐饮企业材料审核岗位职责

第3章正弦波振荡器(1)-大连海事大学本科教学质量与教学

2公立医院法人治理结构探讨XXXX0328卫生部医管所讲课

第二章 个性价值观与工作态度(第3章)

相关文档

相关搜索

第二章个性价值观与工作态度(第3章)