您好,欢迎访问三七文档
当前位置:首页 > 商业/管理/HR > 信息化管理 > 基于卫生行业信息系统的数据仓库和数据挖掘设计
上海交通大学硕士学位论文基于卫生行业信息系统的数据仓库和数据挖掘设计姓名:石景明申请学位级别:硕士专业:软件工程指导教师:王英林;周浩20061201-91-200729-92-200729200729-3-10OLTPOLAP/DSSETLETLCWMXMLETLETLETL-4-AbstractInformatizationcourseofhomelandhygieneindustrydevelopedmorethan10years,andnowhasalreadygotbroadapplicationinmanybusinessbrancheswithinhygieneindustry.ButatpresentlotsofbusinesssystemsarestillinOLTPsystemstagewhichmainlyconcludesdailybusinessoperationsandbasedonrelational-typedatabase,andstilldonotruntoOLAP/DSSsystemstagewhichmainlyconcludesdataanalysisandbasedondatawarehouse.Theinformationcollectedbycomputercan'tbevaluableunlessbeminedefficiently.Thereforethepapersatisfiesindustryneed,andsufficientlyminesvalueofinformation,soastoservehygieneindustrybetter.Theresearchofthispaperrealizesmedicalhealthinformationdatamininganalysis,andhaschosenanactualhygienicresourcesdatabaseastheresearchobject.Thepapercompletelyrealizesallstagesfromdatawarehouseconstructing,datamining,todatapresentation.Themaincontentofwhichconcludesspecialdatawarehouseconstructing,ETLofdatawarehouseandapplyingdata-miningalgorithmstothemedicalhealthinformation.Constructingdatawarehousecoversentireprocessincludingtheconceptualmodeldesign,logicalmodeldesign,physicalmodeldesignanddatawarehousebuilding.ETLbasedonthestandardMetadatamodelCWM,designedanddevelopedstandardETLtoolsbasedonXMLdataexchangetechnology.Mainlyconsidersrequirementemphasisofhygienicinformationapplication,thepapermainlystudieddataanalysisanddataminingwhicharebasedontherelationrule,dataminingalgorithmswhichbasedongatheredkindofanalysisandsoon,andobtainedsignificantresearchresultsonhygienicmanagementespeciallyondiseasecontrol.Thepaperalsointroduceddatapresentationofthedataanalysisresults,andmainlyutilizessomeefficienttoolstorealizeit.Thepaper'ssignificanceliesinthattheprojectbasedonETLtechnologysonotonlysuccessfullyintegrateddistributedinformationintohygienicinformationdatawarehousewhichbegantotakeshape,butalsoappliedsomedataminingalgorithmstorealizethemedicalhealthinformationdataanalysisandmining,andthesemethodscanbeareferenceorinstructiontohealthmanagement,diseasecontrolandsocietyresidentshealthimprovement.KeywordsMedicalhealthinformationDatawarehourseETLDatamining-7-11.1,1.2200075ETL-8-ETL1.323ETLETLCWMXMLETL456-9-22.1]1[2.1.11??-10-??22.1.2-11-1232-1-12-2-1Table2-1Userinforrequirementtable12345……41648……1260300……11……8……1254CRUDCRUDCCreatRReadUUpdateDDelete]2[ERDERD]2[-13-CRUDCRUDCRUDCRUDCDCDCDUCRUDCRUD2-2CRUD2-2CRUDTable2-2RelationofEntityandfunctionCRUDmatrixCCRUDCRURRUDCRUDRUDRCCRUDCRURRUDCRUDRUDR53/-14-2-3Table2-3DatastoragemodeltableofoperationmanagersystemOracleSysbaseSQLServerVFPExcel2.1.3-15-2-1Fig2-1RelationgraphofbasetopicERD2-22.1.4ERD2-2-16-2-2Fig2-2Conceptmodelgraphofthreetopic1]3[2-12-1-17-2-32-3Fig2-3Starrinessmodelgraphofdiseasetopic22-4-18-2-4Fig2-4Snowmodelgraphofdiseasetopic2.22.2.11:XML2:?-19-???2.2.2IBMOracleSybaseMicrosoftSAS]4[IBMIBMBIVisualWarehouseVWEssbase/DB2OLAPServer5.0IBMDB2UDBBOSASVWEssbase/DB2OLAPServerEssbase/DB2OLAPServerROLAPRelationalOLAPROLAPMOLAPHOLAPEssbaseDB2UDBIBMBusinessObjectsBOLotusApproachCognosImpromptuIBMQueryManagementFacilityArborSoftwareEssbaseIBMArborDB2OLAPSASOracleORACLE10gORACLE10g8EOracle10gETL-20-ORACLESQLLoaderSQLSQLBIOLAPOraclePL/SQLXMLSQLIMPERTOEMORACLE10gXMLORACLE94G4G(8-128T)SQLORACLESybaseSybaseWarehouseStudioWarehouseArchitectPowerDesignerERPowerStageReplicationServerCarletonPASSPORTPowerStageSybaseAdaptiveServerEnterpriseSybaseAdaptiveServerIQAdaptiveServerIQRDBMS100AdaptiveServerIQMultiplexAdaptiveServerIQMultiplexAdaptiveServerIQ-21-SybaseIQWebDBASybaseIndustryWarehouseStudio(IWS)IWSIWSSybaseQuickStartDataMartQuickStartDataMartMicrosoftMicrosoftOLAPMicrosoftSQLServerCOMOLAPDTSDataTransformationServices/MicrosoftRepositoryMicrosoftRepositorySQLServerOLAPServicesPivotTableServicesOLAPVBPivotTableServicesMMCMicrosoftManagementConsoleMicrosoftOffice2000AccessExcelSQLServerSASSAS207090SAS30SAS/WAWarehouseAdministrator-22-SAS/MDDBSASSAS/AFSCLSAS/ITSVITServiceVisionITITWebSASHTMLPDFWeb,WebWebMicrosoftExcelSASWebWebSASORACLEORACLE10GSybasePowerDesignerMicrosoftDTSDataTransformationServices2.31:-23-2-52-5Table2-5Detaileddescribeoftopicfield2:]5[:-24--lshvarchar(50)xhSmallintjbdmvarchar(10)jbmcvarchar(60)hbrsInttjjsInthbldecimal(8,2)gcbdecimal(8,2)tjndvarchar(10)ksrqDatetimejsrqDatetimetjqyvarchar(60)tjrqvarchar(50)-lshvarchar(50)xhSmallintjgdmvarchar(10)jgmcvarchar(60)hbs1Inthbl1decimal(8,2)hbs2IntHbl2decimal(8,2)Hbs3IntHbl3decimal(8,2)Hbs4IntHbl4decimal(8,2)Hbs5IntHbl5decimal(8,2)Hbs6Int-25-hbs10decimal(8,2)hjhbsInthjhbldecimal(8,2)tjndvarchar(10)ksrqDatetimejsrqDatetimetjqyvarchar(60)tjrqvarchar(50)-lshvarchar(50)xhSmallintmzdmvarchar(10)mzmcvarchar(60)hbs1Inthbl1decimal(8,2)hbs2IntHbl2decimal(8,2)Hbs3IntHbl3decimal(8,2)Hbs4IntHbl4decimal(8,2)Hbs5IntHbl5decimal(8,2)Hbs6Inthbs10decimal(8,2)hjhbsInthjhbldecimal(8,2)tjndvarchar(10)ksrqDatetimejsrqDatetimetjqyvarchar(60)tjrqvarchar(50)3:-26-()4(3NFThirdNormalForm)(Star-Schema)]6[(Normalize)]7[:123-27-(FactTable)(DimensionTable)(
本文标题:基于卫生行业信息系统的数据仓库和数据挖掘设计
链接地址:https://www.777doc.com/doc-4291000 .html