LEARNING TO CONTROL pH PROCESSES AT MULTIPLE TIME

LEARNINGTOCONTROLpHPROCESSESATMULTIPLETIMESCALES:PERFORMANCEASSESSMENTINALABORATORYPLANT1S.Syafiie,1F.Tadeoand2E.Martinez1DepartmentofSystemsEngineeringandAutomaticControl.ScienceFaculty,UniversityofValladolid.PradodelaMagdalenas/n.,47011Valladolid.Spain.Email:{syam,fernando}@autom.uva.es2ConsejoNacionaldeInvestigacionesCientíficasyTécnicas,Avellaneda36573000,SantaFe,ArgentinaEmail:ecmarti@ceride.gov.arABSTRACTThisarticlepresentsasolutiontopHcontrolbasedonmodel-freelearningcontrol(MFLC).TheMFLCtechniqueisproposedbecausethealgorithmgivesageneralsolutionforacid-basesystem,yetissimpleenoughforimplementationinexistingcontrolhardware.MFLCisbasedonreinforcementlearning(RL),whichislearningbydirectinteractionwiththeenvironment.TheMFLCalgorithmismodelfreeandsatisfyingincrementalcontrol,inputandoutputconstraints.AnovelsolutionofMFLCusingmulti-stepactions(MSA)ispresented:actionsonmultipletimescalesconsistofseveralidenticalprimitiveactions.Thissolvestheproblemofdeterminingasuitablefixedtimescaletoselectcontrolactionssoastotradeoffaccuracyincontrolagainstlearningcomplexity.AnapplicationofMFLCtoapHprocessatlaboratoryscaleispresented,showingthattheproposedMFLClearnstocontroladequatelytheneutralizationprocess,andmaintaintheprocessinthegoalband.Also,theMFLCcontrollersmoothlymanipulatesthecontrolsignal.KEYWORDS:learningcontrol,goalseekingcontrol,intelligentcontrol,onlinelearning,pHcontrol,processcontrol,neutralizationprocess1.INTRODUCTIONControlofpHinneutralizationprocessesisaubiquitousproblemencounteredinchemicalandbiotechnologicalindustries.Forexample,pHvalueiscontrolledinchemicalprocessessuchasinfermentation,precipitation,oxidation,flotationandsolventextractionprocesses.Also,controllingthepHinfoodandbeverageproductionisanimportantissuesuchasinbread,liquor,beer,soysauce,cheese,andmilkproductionbecausetheenzymaticreactionsareaffectedbythepHvalueoftheprocessandeachhasitsoptimumpHcriticaltotheyield.AnotherpHcontrolapplicationinindustryis,forexample,inthedecompositionsectionofSucono/UOPPhenolProcess.Theacidcatalystthatisaddedinthedecompositionsectionmustbeneutralizedtopreventyieldlossduetosidereactionsandprotectagainstcorrosioninthefractionationsection(Schmidt,2005).InmostpHneutralizationprocessesthecontrolofpHisnotonlyacontrolproblembutalsoinvolvesthechemicalequilibrium,kinetic,thermodynamicandmixingproblems.Thesecharacteristicmustbeconsideredindesigningacontroller(Gustafssonetal.,1995).TheseinherentcharacteristicsofpHprocessesareaninterestingandchallengingoneforresearchestolookupforsolutions.Animportantproblemisthatiftheprocessbuffercapacityvarieswithtime,whichisunknownanddramaticallychangesprocessgain,makingdifficulttodesignacontroller.Forexample,ifeithertheconcentrationintheinletstreamorthecompositionofthefeedchanges,theshapeofthetitrationcurvewillbedrasticallyaltered.Thismeansthattheprocessnonlinearitybecomestime-varyingandthesystemmovesamongseveraltitrationcurves.Also,duetothenonlineardependenceofthepHvalueontheamountoftitratedreactanttheprocesswillbeinherentlynonlinear.Therefore,itisdifficulttodevelopanappropriatemathematicalmodelofthepHprocessfordesigningawell-performingcontroller.Manystrategiesbasedonintelligentcontrolhavebeenproposedbydifferentresearchers,applyingawidearrayoftechniquessuchfuzzycontrol,neuralnetworksordifferentcombinationofintelligentandmodel-basedmethods.Forexample,fuzzylogic(Fuenteetal.,2006)andneuralnetworks(RamirezandJackson,1999)havebeenimplementedonpHcontrol.FuzzyselftuningPIcontrol(Babuskaetal.,2002)andfuzzyinternalmodelcontrol(EdgarandPostlethwaite,2000)havealsobeenimplementedtocontrolpHprocesses.Theapproachescitedabovehaveseveraldifficultiesforpracticalapplications,andalsoaredifficultwhentacklingcontrolsystemdesign.Theresultingcontrolstructuresarecomplexanddifficulttosupervise.Theymightbeconservativeormayhavemanytuningparameters.Thus,tightandrobustpHcontrolisoftendifficulttoachieveduetotheinherentuncertain,nonlinearandtimevaryingcharacteristicsofpHneutralizationprocesses.ThispaperdiscussesanalternativeapproachtosolvethepHcontrolproblembyapplyingModel-FreeLearningControl(MFLC)(Syafiieetal.,2004;2005;2006a;2006b),basedonthereinforcementlearningframework(SuttonandBarto,1998).InstandardRLalgorithms,likeQ-learning,thereisadifficultyforProcessControlimplementations:thesealgorithmsscaleverybadlywithincreasingproblemsize,granularityofstatesorcontrolactions.Amongothers,oneintuitivereasonforthisisthatthenumberofdecisionsfromthestartstatetothegoalstateincreaseexponentially.Accordingtotheproblemsize,tokeeptractablethenumberofdecisiontobetakentoreachthegoalstate,hierarchicalapproachesbasedontemporalabstractionhavebeenproposed.Temporalabstractioncanbedefinedasanexplicitrepresentationofextendedactions,aspoliciestogetherwithaterminationcondition(Precup,2000).Theoriginalone-stepactioniscalledprimitiveaction.SemiMarkovDecisionProcesses(SMDPs)isthetheoryusedtodealwiththetemporalabstractionasaminimalextensionofRLframeworks.SMDPsisaMarkovDecisionProcesses(MDP)appropriateformodelingcontinuous-timediscrete-eventsys

LEARNING TO CONTROL pH PROCESSES AT MULTIPLE TIME

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

质量认证案例分析

陇川县质量技术监督局深入学习实践

商用密码产品品种和型号申请材料模版(通用产品类)

当代领导干部道德建设(靳凤林)

突发事件处置应急预案管理手册0920

中国企业培训第一人管理者的压力管理（DOC 29）

中石化-加油站主管计量操作及数质量技能考核试题(A卷答案)

人事制度改革实施方案

第5讲一般均衡和福利经济学

消防抢险救援

相关文档

相关搜索