您好,欢迎访问三七文档
当前位置:首页 > 商业/管理/HR > 销售管理 > 天津移动业务支撑应急系统设计与实现
摘要目前中国移动集团天津公司NG-CRM/BOSS系统的业务连续性保障体系有三种模式,一种是多节点负荷分担方式,该方式主要用于系统接入层和业务逻辑层,有效地降低了个别节点故障对业务的影响程度;一种是容灾模式,由于多年未升级,系统资源与生产中心已不匹配,在发生突发事件时,容灾系统不能在特定的时间要求内全部或部分恢复关键业务功能;一种是双机备份共享存储(以下简称本地HA)方式,该方式主要用于系统核心层。对于系统核心层采用的本地HA模式来保障业务连续性,存在如下风险:1)由于核心系统IO量较大,如发生系统单节点宕机等严重故障可能会造成由于IO未及时写入磁盘而产生的文件系统错误,导致备机启动失败。2)人为因素、数据库逻辑错误或者存储故障造成的数据损坏从而引起业务中断,本地HA将无法解决。NG-CRM/BOSS系统全部业务要求7×24小时运行,存储阵列的使用强度大大增加,没有时间对存储系统进行定期维修和保养。因此,当使用一段时间后,存储系统的部件连续或同时出现故障的可能性增加。此外,随着存储系统的功能和性能越来越强,存储系统内部的控制软件也日趋复杂,就像一个操作系统,其本身也会出现故障或漏洞。部分省公司也曾经发生过由于存储故障造成业务系统长时间停机、数据丢失的重大故障。3)在系统割接、平台软硬件维护或应用版本升级等情况下,本地HA都将可能无法满足业务连续性要求。4)生产机房发生火灾、泡水等情况下,多节点负载分担和本地HA模式都不能保障业务连续性。本文将从应急系统的系统架构、建设实现、系统测试各方面对于上述风险及问题进行研究并逐一解决。关键词:业务支撑系统应急系统运营商ABSTRACTAtpresenttheTianjinNG-CRM/BOSSbusinesscontinuitysecuritysystemhasthreemodes,oneisamulti-nodeloadbalancingmode,thismodeismainlyusedforsystemaccesslayerandbusinesslogic,effectivelyreducingtheindividualnodefailuresthedegreeofinfluenceofthebusiness;adisasterrecoverymode,duetoyearsofnotupgraded,thesystemresourcesandproductioncenterdoesnotmatch,notwithinaspecifictimerequirementsinwholeorinpart,torestorecriticalbusinessfunctionsintheeventofanemergency,disasterrecoverysystem;adoublebackupsharedstorage(hereinafterreferredtoasthelocalHA)mode,whichismainlyusedforthecoreofthesystemlayer.ThelocalHAmodeforthesystemcorelayertoprotectbusinesscontinuity,thefollowingrisks:1)duetothelargeamountofcoresystemIO,suchastheoccurrenceofaseriousfailureofthesystemsingle-nodedowntimemaycauseIOisnotwrittentodiskfilesystemerrors,leadingtothebackupmachinefailedtostart.2)datacorruptioncausedbyhumanfactors,databaselogicerrororstoragefailurecausingbusinessinterruption,localHAwillnotresolve.AllofNG-CRM/BOSSsystemrequirements7×24hourstorun,greatlyincreasetheintensityofuseofthestoragearray,donothavetimeforregularrepairandmaintenanceofthestoragesystem.Therefore,whenusedforaperiodoftime,thecomponentsofthestoragesystemcontinuouslyoratthesametimeincreasetheprobabilityoffailure.Inaddition,withthegrowingfunctionalityandperformanceofstoragesystems,storagesystemswithinthecontrolsoftwarearebecomingincreasinglycomplex,asanoperatingsystem,whichitselfwillbefailureorvulnerability.Someprovinceshavealsoundergonemajorfailureofthebusinesssystemforalongtimedowntime,datalossduetoastoragefailure.3)inthesystemcutover,platformhardwareandsoftwaremaintenanceorapplicationupgrade,thelocalHAmaynotbeabletomeettherequirementsofbusinesscontinuity.4)productionengineroomfire,flooddamageandothercircumstances,multi-nodeloadbalancingandthelocalHAmodecannotguaranteebusinesscontinuity.Fromtheemergencysystemarchitecture,construction,implementation,systemtestingallaspectsoftherisksandproblemsandsolvethemonebyone.KEYWORDS:NG-CRM/BOSS,EmergencySystem,TelecomOperators目录目录..........................................................................................................................................4第一章绪论................................................................................................................................11.1研究背景........................................................................................................................11.2研究目的及意义............................................................................................................11.3研究的主要内容及论文结构.............................................................................................2第二章天津移动业务支撑系统现状分析及应急建设需求.......................................................32.1系统现状及风险分析........................................................................................................32.1.1功能现状.................................................................................................................32.1.2软硬件配置现状......................................................................................................42.1.3网络组织现状.........................................................................................................62.1.4风险分析.................................................................................................................82.1.5风险应对措施.........................................................................................................92.2应急建设需求.................................................................................................................112.2.1业务建设范围.......................................................................................................112.2.2接管时间要求.......................................................................................................152.2.3应急数据同步.......................................................................................................152.2.3应急数据回切.......................................................................................................162.2.3应急系统管理功能................................................................................................17第三章天津移动业务支撑应急系统技术研究.........................................................................193.1持续数据保护技术(CDP)............................................
本文标题:天津移动业务支撑应急系统设计与实现
链接地址:https://www.777doc.com/doc-1639920 .html