Learning Bayesian Networks From Data An Efficient

1LearningBayesianNetworksfromData:AnEfficientApproachBasedonInformationTheoryJieChengDept.ofComputingScienceUniversityofAlbertaAlberta,T6G2H1Email:jcheng@cs.ualberta.caDavidBell,WeiruLiuFacultyofInformatics,UniversityofUlster,UKBT370QBEmail:{w.liu,da.bell}@ulst.ac.ukAbstractThispaperaddressestheproblemoflearningBayesiannetworkstructuresfromdatabyusinganinformationtheoreticdependencyanalysisapproach.Basedonourthree-phaseconstructionmechanism,twoefficientalgorithmshavebeendeveloped.Oneofouralgorithmsdealswithaspecialcasewherethenodeorderingisgiven,thealgorithmonlyrequire)(2NOCItestsandiscorrectgiventhattheunderlyingmodelisDAG-Faithful[Spirteset.al.,1996].Theotheralgorithmdealswiththegeneralcaseandrequires)(4NOconditionalindependence(CI)tests.ItiscorrectgiventhattheunderlyingmodelismonotoneDAG-Faithful(seeSection4.4).AsystembasedonthesealgorithmshasbeendevelopedanddistributedthroughtheInternet.Theempiricalresultsshowthatourapproachisefficientandreliable.1IntroductionTheBayesiannetworkisapowerfulknowledgerepresentationandreasoningtoolunderconditionsofuncertainty.ABayesiannetworkisadirectedacyclicgraph(DAG)withaprobabilitytableforeachnode.ThenodesinaBayesiannetworkrepresentpropositionalvariablesinadomain,andthearcsbetweennodesrepresentthedependencyrelationshipsamongthevariables.OnconstructingBayesiannetworksfromdatabases,weusenodestorepresentdatabaseattributes.Inrecentyears,manyBayesiannetworkstructurelearningalgorithmshavebeendeveloped.Thesealgorithmsgenerallyfallintotwogroups,search&scoringbasedalgorithmsanddependencyanalysisbasedalgorithms.AnoverviewofthesealgorithmsispresentedinSection6.Althoughsomeofthesealgorithmscangivegoodresultsonsomebenchmarkdatasets,therearestillseveralproblems:•Nodeorderingrequirement.Alotofpreviousworkassumesthatnodeorderingisavailable.Unfortunately,inmanytimesthisisnotthecase.•Lackofefficiency.Somerecentalgorithmsdonotneednodeordering,buttheyaregenerallynotveryefficient.AllpracticabledependencyanalysisbasedalgorithmsrequireexponentialnumbersofCItests.•Lackofpubliclyavailablelearningtools.Althoughtherearemanyalgorithmsforthistask,onlyafewBayesiannetworklearningsystemsarepubliclyavailableandonlyoneofthem(TETRADII[Spirtes,et.al.,1996])canbeappliedtoreal-worlddataminingapplicationswherethedatasetsoftenhavehundredsofvariablesandmillionsofrecords.ThismotivatesustodoresearchintheareaofBayesiannetworkstructurelearning.Wedevelopedtwoalgorithmsforthistask,AlgorithmAandAlgorithmB.AlgorithmAdealswithaspecialcasewherethenodeorderingisgiven,whichrequires)(2NOCItestsandiscorrectgiventhattheunderlyingmodelisDAGfaithful.AlgorithmBdealswiththegeneralcaseandrequires)(4NOCItests.ItiscorrectgiventhattheunderlyingmodelismonotoneDAGfaithful.Basedonthesetwoalgorithms,wehavedevelopedaBayesiannetworklearningsystem,calledBayesianNetworkPowerConstructor.ThesystemhasbeenavailableontheInternetsinceOctober1997andhasalreadyenjoyedoveronethousanddownloads.Duetotheveryencouragingexperimentalresultsandpositivefeedbackfromotherusers,weplantoexpandourworktoallowcontinuousandhiddenvariablesintheunderlyingmodels.Wealsoplantodevelopacommercialversionofoursystemandintegrateittolargedatamining,knowledgebaseanddecisionsupportsystems.2Theremainderofthepaperisorganizedasfollows.Section2introducesBayesiannetworklearningfromaninformationtheoreticperspective.InSection3wepresentaBayesiannetworklearningalgorithm(AlgorithmA)foraspecialcasewhennodeorderingisgiven.Thecorrectnessproofandcomplexityanalysisarealsogiven.Section4presentsanextensionofAlgorithmA(calledAlgorithmB),whichdoesnotrequirenodeordering.Then,wegiveitscorrectnessproofandcomplexityanalysis.InSection5,wefirstintroduceourBayesiannetworklearningsystem(BNPowerConstructor),whichimplementsbothofouralgorithms.Then,weanalyzetheexperimentalresultsofbothalgorithmsonreal-worlddatasets.Section6surveysthealgorithmsofBayesiannetworklearningalgorithms.Finally,weconcludeourworkandproposesomefutureresearchdirectionsinSection7.2LearningBayesianNetworksUsingInformationTheoryInthissection,wefirstgivesomebasicconceptsrelatedwithBayesiannetworklearning.Thenweintroduceavitalconceptusedinourapproach-d-separation,fromaninformationtheoreticperspective.2.1BasicConceptsDefinition2.1.AdirectedgraphGcanbedefinedasanorderedpairthatconsistsofafinitesetVofnodesandanirreflexiveadjacencyrelationEonV.ThegraphGisdenotedas),(EV.ForeachEyx∈),(wesaythatthereisanarc(directededge)fromnodextonodey.Inthegraph,thisisdenotedbyanarrowfromxtoyandxandyarecalledthestartpointandtheendpointofthearrowrespectively.Wealsosaythatnodexandnodeyareadjacentorxandyareneighborsofeachother.xisalsocalledaparentofyandyiscalledachildofx.Byusingtheconceptsofparentandchildrecursively,wecanalsodefinetheconceptofancestoranddescendent.Wealsocallanodethatdoesnothaveanyparentarootnode.ByirreflexiveadjacencyrelationwemeanthatforanyVx∈,Exx∉),(,i.e.,anarccannothaveanodeasbothitsstartpointandendpoint.Definition2.2.InBayesiannetworklearning,weoftenneedtofindapaththatconnectstwonodeswithoutconsideringthedirectionalityoftheedgesonthepath.Todistin

Learning Bayesian Networks From Data An Efficient

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

中盈福汇房地产开发有限公司清韵百园产品规划建议

机械控制工程基础(chp2)

建筑工程施工技术资料的整理

深入开展“小金库”治理工作

业务流程4R管理模式

中国全部上市公司01-04年重要财务指标

胸部CT十大征象诊断应用

村两委培训班上动员讲话

建筑工程资料管理用表

招聘流程图(1)

相关文档

相关搜索