Learning with Limited Visibility

LearningwithLimitedVisibilityEliDichtermanDepartmentofMathematicsLondonSchoolofEconomicsHoughtonStreet,LondonWC2A2AE,UK.And,DepartmentofComputerScienceRoyalHollowayUniversityofLondonEgham,SurreyTW200EX,UK.E-mail:eli@cdam.lse.ac.ukAbstractThispapersurveysrecentstudiesoflearningproblemsinwhichthelearnerfacesrestrictionsontheamountofinformationhecanextractfromeachexampleheencounters.OurmainframeworkfortheanalysisofsuchscenariosistheRFA(RestrictedFocusofAttention)model.WhilebeinganaturalrenementofthePAClearningmodel,someofthefundamentalPAC-learningresultsandtechniquesfailintheRFAparadigm;learnabilityintheRFAmodelisnolongercharacterizedbytheVCdimension,andmanyPAClearningalgorithmsarenotapplicableintheRFAsetting.Hence,theRFAformulationreectstheneedfornewtechniquesandtoolstocopewithsomefundamentalconstraintsofrealisticlearningproblems.Wealsopresentsomeparadigmsandalgorithmsthatmayserveasarststeptowardsansweringthisneed.TwomaintypesofrestrictionscanbeconsideredinthegeneralRFAsetting:Inthemorestringentone,calledk-RFA,onlykofthenattributesofeachexamplearerevealedtothelearner,whileinthemorepermissiveone,calledk-wRFA,therestrictionismadeonthesizeofeachobservation(kbits),andnorestrictionismadeonhowtheobservationsareextractedfromtheexamples.Weshowaninformation-theoreticcharacterizationofRFAlearnabilityuponwhichwebuildageneraltoolforprovinghardnessresults.WethenapplythisandothernewtechniquesforstudyingRFAlearningtotwoparticularlyexpressivefunctionclasses,k-decision-lists(k-DL)andk-TOP,theclassofthresholdsofparityfunctionsinwhicheachparityfunctiontakesatmostkinputs.Amongotherresults,weshowahardnessresultfork-RFAlearnabilityofk-DL,kn2.Insharpcontrast,an(n1)-RFAalgorithmforlearning(n1)-DLispresented.Similarly,weprovethat1-DLislearnableifandonlyifatleasthalfoftheinputsarevisibleineachinstance.Inaddition,weshowthatthereisauniform-distributionk-RFAlearningalgorithmfortheclassofk-DL.Fork-TOPweshowweaklearnabilitybyak-RFAalgorithm(withecienttimeandsamplecomplexityforconstantk)andstronguniform-distributionk-RFAlearnabilityofk-TOPwithecientsamplecomplexityforconstantk.Finally,bycombiningsomeofourk-DLandk-TOPresults,weshowthat,unlikethePACmodel,weaklearningdoesnotimplystronglearninginthek-RFAmodel.Wealsoshowageneraltechniqueforcomposingecientk-RFAalgorithms,andapplyittodeduce,forinstance,theecientk-RFAlearnabilityofk-DNFformulas,andtheecient1-RFAlearnabilityofaxis-alignedrectanglesintheEuclideanspaceRn.Wealsoprovethek-RFAlearnabilityofricherclassesofBooleanfunctions(suchask-decisionlists)withrespecttoagivendistribution,andtheecient(n1)-RFAlearnability(forxedn),underproductdistributions,ofclassesofsubsetsofRnwhicharedenedbymildsurfaces.Forthek-wRFArestriction,weshowthatfork=O(logn),ecientk-wRFAlearningisrobustagainstclassicationnoise.Asastraightforwardapplication,weobtainanewsimplenoise-tolerantalgorithmfortheclassofk-decisionlists,byconstructinganintuitivek-wRFAalgorithmforthistask.11IntroductionLearningtheoryhasbeenmainlyconcernedwiththeproblemofgeneralizingfromasampleoffully-speciedclassiedexamples.Forthisproblemclassicalstatisticaluniformconvergencetheoremshavebeenusedtocharacterizescenariosinwhichagoodgeneralizationcanbefoundwithhighcondence[42],specicboundsonthesamplesizeneededforsuchgeneralizationhavebeenproven[14],andecientlearningalgorithmshavebeendesignedforspeciccases(cf.[41]).Ithasalsobeennoticedthatinmanyrealisticscenarios,thesamplesfromwhichthelearnerhastogeneralizearenotfullyspecied[28,30].Theimportanceofthis\focus-of-attentionproblemhasbeennoticedsincetheemergenceofthecomputationallearningtheory[1].However,thelearningmodelswhichhavebeenformulatedforstudyingthistypeofproblemsusuallyassume|sometimesimplicitly[13]|thatthereisaxedsetofrelevantvariableswhichareinvisibletothelearner.Insuchproblems,thelearnermayonlyattempttondagoodprobabilisticpredictionrulewithrespecttothevisibleattributes.However,asobservedbyBen-DavidandDichterman[6],therearemanycasesinwhichtherearenoattributeswhichareinherentlyinvisible,butratherthereareotherrestrictionsonthevisibilityoftheattributes,suchastheamountofvisibleattributesineachsingleexample.Sinceinsuchcaseseveryattributeispotentiallyvisible,thelearnermayattempttondmorethanjustaprobabilisticpredictionrule;hemaytrytoformulateafulldescriptionoftheconceptwithrespecttoalltherelevantattributes.Consider,forinstance,medicalresearchwhichaimsatformingtheexactpatternofsomedisease.Typically,thereissomeaprioriknowledgeaboutthedisease,suchasthepotentiallyrelevantattributesofthediseaseandthepossiblepatternsofthediseasewithrespecttotheseattributes.Then,inthecourseofstudyingthedisease,itisusuallypossibletosamplepeoplefromagivenpopulationandconductseveraltestsoneachoneofthem.However,duetopracticalconsiderations(e.g.,thecostofthetests),orinherentrestrictions(e.g.,thefactthatsomebloodtestsmaybedestructive,ormaynotbeusableformorethanalimitednumberoftests),theamountofdatathatisavailableforeachsinglepersonislimited.Insuchcircumstances,researchersfacethefollowingproblem:Theycanchooseaseto

Learning with Limited Visibility

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

电脑外包装缓冲配件EPE生产项目

检修部机械分部机加工岗位说明书

通信行业信息报道-上海东洲集团

新赞助策划书

江油仁博国际健康城项目策划方案

用品牌或产品“黏性”来细分市场――转换模型与应用实例

电工仪表发展史

比较切实可行的薪酬管理办法

领导力管理

4联想联想集团品牌宣传方略

相关文档

相关搜索