An introduction to machine learning and graphical

.AnintroductiontomachinelearningandprobabilisticgraphicalmodelsKevinMurphyMITAILabPresentedatIntel’sworkshopon“Machinelearningforthelifesciences”,Berkeley,CA,3November20032OverviewSupervisedlearningUnsupervisedlearningGraphicalmodelsLearningrelationalmodelsThankstoNirFriedman,StuartRussell,LeslieKaelblingandvariouswebsourcesforlettingmeusemanyoftheirslides3SupervisedlearningyesnoColorShapeSizeOutputBlueTorusBigYBlueSquareSmallYBlueStarSmallYRedArrowSmallNF(x1,x2,x3)-tLearntoapproximatefunctionfromatrainingsetof(x,t)pairs4SupervisedlearningX1X2X3TBTBYBSSYBSSYRASNX1X2X3TBAS?YCS?LearnerTrainingdataHypothesisTestingdataTYNPrediction5Keyissue:generalizationyesno??Can’tjustmemorizethetrainingset(overfitting)6HypothesisspacesDecisiontreesNeuralnetworksK-nearestneighborsNaïveBayesclassifierSupportvectormachines(SVMs)Boosteddecisionstumps…7Perceptron(neuralnetwithnohiddenlayers)Linearlyseparabledata8Whichseparatinghyperplane?9Thelinearseparatorwiththelargestmarginisthebestonetopickmargin10Whatifthedataisnotlinearlyseparable?11Kerneltrickx1x2z1z2z3kernel222xxxyyyKernelimplicitlymapsfrom2Dto3D,makingproblemlinearlyseparable12SupportVectorMachines(SVMs)Twokeyideas:LargemarginsKerneltrick13BoostingSimpleclassifiers(weaklearners)canhavetheirperformanceboostedbytakingweightedcombinationsBoostingmaximizesthemargin14SupervisedlearningsuccessstoriesFacedetectionSteeringanautonomouscaracrosstheUSDetectingcreditcardfraudMedicaldiagnosis…15UnsupervisedlearningWhatiftherearenooutputlabels?16K-meansclustering1.Guessnumberofclusters,K2.Guessinitialclustercenters,1,23.Assigndatapointsxitonearestclustercenter4.Re-computeclustercentersbasedonassignmentsReiterate17AutoClass(Cheesemanetal,1986)EMalgorithmformixturesofGaussians“Soft”versionofK-meansUsesBayesiancriteriontoselectKDiscoverednewtypesofstarsfromspectraldataDiscoverednewclassesofproteinsandintronsfromDNA/proteinsequencedatabases18Hierarchicalclustering.PrincipalComponentAnalysis(PCA)PCAseeksaprojectionthatbestrepresentsthedatainaleast-squaressense.PCAreducesthedimensionalityoffeaturespacebyrestrictingattentiontothosedirectionsalongwhichthescatterofthecloudisgreatest.20Discoveringnonlinearmanifolds21Combiningsupervisedandunsupervisedlearning22Discoveringrules(datamining)Occup.IncomeEduc.SexMarriedAgeStudent$10kMAMS22Student$20kPhDFS24Doctor$80kMDMM30Retired$30kHSFM60Findthemostfrequentpatterns(associationrules)Numinhousehold=1^numchildren=0=language=EnglishLanguage=English^Income$40k^Married=false^numchildren=0=education{college,gradschool}23Unsupervisedlearning:summaryClusteringHierarchicalclusteringLineardimensionalityreduction(PCA)Non-lineardim.ReductionLearningrules24Discoveringnetworks?Fromdatavisualizationtocausaldiscovery25NetworksinbiologyMostprocessesinthecellarecontrolledbynetworksofinteractingmolecules:MetabolicNetworkSignalTransductionNetworksRegulatoryNetworksNetworkscanbemodeledatmultiplelevelsofdetail/realismMolecularlevelConcentrationlevelQualitativelevelDecreasingdetail26Molecularlevel:Lysis-LysogenycircuitinLambdaphageArkinetal.(1998),Genetics149(4):1633-485genes,67parametersbasedon50yearsofresearchStochasticsimulationrequiredsupercomputer27Concentrationlevel:metabolicpathwaysUsuallymodeledwithdifferentialequationsw23g1g2g3g4g5w12w5528Qualitativelevel:BooleanNetworks29ProbabilisticgraphicalmodelsSupportsgraph-basedmodelingatvariouslevelsofdetailModelscanbelearnedfromnoisy,partialdataCanmodel“inherently”stochasticphenomena,e.g.,molecular-levelfluctuations…Butcanalsomodeldeterministic,causalprocesses.Theactualscienceoflogicisconversantatpresentonlywiththingseithercertain,impossible,orentirelydoubtful.Thereforethetruelogicforthisworldisthecalculusofprobabilities.--JamesClerkMaxwellProbabilitytheoryisnothingbutcommonsensereducedtocalculation.--PierreSimonLaplace30Graphicalmodels:outlineWhataregraphicalmodels?InferenceStructurelearning31Simpleprobabilisticmodel:linearregressionYY=+X+noiseDeterministic(functional)relationshipX32Simpleprobabilisticmodel:linearregressionYY=+X+noiseDeterministic(functional)relationshipX“Learning”=estimatingparameters,,from(x,y)pairs.CanbeestimatebyleastsquaresIstheempiricalmeanIstheresidualvariance33PiecewiselinearregressionLatent“switch”variable–hiddenprocessatwork34ProbabilisticgraphicalmodelforpiecewiselinearregressionXYQ•HiddenvariableQchooseswhichsetofparameterstouseforpredictingY.•ValueofQdependsonvalueofinputX.outputinput•Thisisanexampleof“mixturesofexperts”LearningisharderbecauseQishidden,sowedon’tknowwhichdatapointstoassigntoeachline;canbesolvedwithEM(c.f.,K-means)35ClassesofgraphicalmodelsProbabilisticmodelsGraphicalmodelsDirectedUndirectedBayesnetsMRFsDBNs36FamilyofAlarmBayesianNetworksQualitativepart:Directedacyclicgraph(DAG)Nodes-randomvariablesEdges-directinfluenceQuantitativepart:Setofconditionalprobabilitydistributions0.90.1ebe0.20.80.010.990.90.1bebbeBEP(A|E,B)EarthquakeRadioBurglaryAlarmCallCompactrepresentationofprobabilitydistributionsv

An introduction to machine learning and graphical

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

博星卓越供应链管理教学实验系统实验指导书(教师版)

武汉万科房地产项目工程管理45P

隐蔽工程验收记录（DOC46页）

黔张常铁路9标桥梁桩基施工方案

10、申请零售业发票的数量1

药品质量规范现场检查指导原则

第七章抽样调查

农副产品进口检验监管政策

重庆同天绿岸开盘盛典策划案

商务谈判实务与技巧01

相关文档

相关搜索

An introduction to machine learning and graphical

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

博星卓越供应链管理教学实验系统实验指导书(教师版)

武汉万科房地产项目工程管理45P

隐蔽工程验收记录（DOC46页）

黔张常铁路9标桥梁桩基施工方案

10、申请零售业发票的数量1

药品质量规范现场检查指导原则

第七章 抽样调查

农副产品进口检验监管政策

重庆同天绿岸开盘盛典策划案

商务谈判实务与技巧01

相关文档

相关搜索

第七章抽样调查