Automatic image annotation and retrieval using wei

AutomaticImageAnnotationandRetrievalUsingWeightedFeatureSelectionLEIWANGleiwang@utdallas.eduLATIFURKHANlkhan@utdallas.eduDepartmentofComputerScience,UniversityofTexasatDallas,Richardson,Texas75083Abstract.Thedevelopmentoftechnologygenerateshugeamountsofnon-textualinformation,suchasimages.Anefficientimageannotationandretrievalsystemishighlydesired.Clusteringalgorithmsmakeitpossibletorepresentvisualfeaturesofimageswithfinitesymbols.Basedonthis,manystatisticalmodels,whichanalyzecorrespondencebetweenvisualfeaturesandwordsanddiscoverhiddensemantics,havebeenpublished.Thesemodelsimprovetheannotationandretrievaloflargeimagedatabases.However,imagedatausuallyhavealargenumberofdimensions.Traditionalclusteringalgorithmsassignequalweightstothesedimensions,andbecomeconfoundedintheprocessofdealingwiththesedimensions.Inthispaper,weproposeweightedfeatureselectionalgorithmasasolutiontothisproblem.Foragivencluster,wedeterminerelevantfeaturesbasedonhistogramanalysisandassigngreaterweighttorelevantfeaturesascomparedtolessrelevantfeatures.WehaveimplementedvariousdifferentmodelstolinkvisualtokenswithkeywordsbasedontheclusteringresultsofK-meansalgorithmwithweightedfeatureselectionandwithoutfeatureselection,andevaluatedperformanceusingprecision,recallandcorrespondenceaccuracyusingbenchmarkdataset.Theresultsshowthatweightedfeatureselectionisbetterthantraditionalonesforautomaticimageannotationandretrieval.Keywords:automaticimageannotation,subspaceclusteringalgorithm1.IntroductionImagesareamajorsourceofcontentontheInternet.Thedevelopmentoftechnologysuchasdigitalcamerasandmobiletelephonesequippedwithsuchdevicesgenerateshugeamountsofnon-textualinformation,suchasimages.Anefficientimageretrievalsystemisdesirablewheregivenalargedatabase,weneed,forexample,tofindtheimagesthathavetigers,orgivenanunseenimage,findkeywordsthatbestdescribeitscontent[Duygulu02].Hence,thesetechniquesraisethepossibilityofseveralinterestingapplicationssuchas:·Automaticimageannotation/description:inmanycasescollectionsofimagesarekeptforvarioususes.Newspapersmaywanttoretrieveimagesfromaso-calledmorgue[Markkula2000].Imageretrievalmightalsobeanimportantpartofintelligencegatheringandsurveillance.Withregardtoadatabankofimagesannotationthroughtheuseofkeywordsisoftenanuncertainproposition.Technicaladvancesinthefieldofautomaticimageannotationwouldbemostwelcome.·Afurtherapplicationofimageretrievaltoolsinvolvesarthistoryandpublicmuseums.Inthecaseofthelatterimagesareoftenpublishedontheweb.Whiletheentirecollectioncannotbepracticallyposteditwouldbeusefultoallowpatronsorstudentstoaccessanarchiveinsearchofparticularimages[Forst2000].Thismeansamethodtoorganizethecollectionthatsupportedbrowsingwouldbeattractiveandmademoresensetovisitors.Aggregatingimagesthatlookedsimilarandweresimilarlyannotatedwouldbeagoodstart.·Commercialimagecollectionscouldofferanattractiveserviceifsearchingthecollectioncouldbemadelessdifficultandexpensive.Illustratingtextwithimagescouldbemademucheasier,andcouldeventaketheformofauto-illustrationifreasonableresultscouldbecheaplyobtained.Content-basedimageretrieval(CBIR)computesrelevancebasedonthevisualsimilarityoflow-levelimagefeaturessuchascolorhistograms,textures,shapesandspatiallayoutetc.However,theproblemisthatvisualsimilarityisnotsemanticsimilarity.Thereisagapbetweenlow-levelvisualfeaturesandsemanticmeanings.Theso-calledsemanticgapisthemajorproblemthatneedstobesolvedformostCBIRapproaches.Forexample,aCBIRsystemmayansweraqueryrequestfor‘redball’withanimageofa‘redrose’.Ifweprovideannotationofimageswithkeywords,thentypicalwaytopublishanimagedatarepositoryistocreateakeyword-basedqueryinterfacetoanimagedatabase.Imagesareretrievediftheycontain(somecombinationofthe)keywordsspecifiedbytheuser.Ourgoalistoquerytheimagesnotonlybasedonentiretybutalsoontheindividualobjectsthatappearinimages.Forexample,theusercanspecifyaquerybysayingonly“tiger”objectandresultsetofobjectswillbetigerobject.Ontheotherhand,ausercanspecifythequerybyspecifyingonly“tiger”objectexcluding“river”objectintheimage.Inthatcase,resultsetofimageswillincludeimagesthatcontain“tiger”objectandnot“river”object.Toachieveallthesegoalsatthisfinegranularitythereareseveraltechnicalchallenges:ItisImportanttonotethatinthispaperobjectandvisualtokenswillbeusedinterchangeably.1.Segmentimagesintomeaningfulvisualsegments/tokens.2.Determinecorrelationbetweenassociatedkeywordsandvisualtokens.Withregardtothefirstproblemwerelyonnormalizedcutthatsegmentimagesintoanumberofvisualtokens[Shi97].Eachvisualtokenwillberepresentedbyavectorofcolors,textures,shapeetc.Therefore,visualtokenmeansasegmentedregionorobject,anditwillbedescribedbyasetoflowlevelfeatureslikecolor,texture,andshape.InFigure1wherefirstcolumncorrespondsimages,thesecondcolumnrepresentssegmentedimagesorasetofvisualtokens.Figure1.DemonstrationofCorrespondencebetweenImageobjectsandtheirKeywordWithregardtothesecondproblem,thereareseveraltasksonecouldattack.First,onecouldattempttopredictannotationsofentireimagesusingallinformationpresentwhichisannotationtask.Next,onemightatt

Automatic image annotation and retrieval using wei

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

保定热电建设项目工程质量管理研究

中国保险业发展“十一五”规划纲要(DOC22)(1)

XXXX《药学实践技能》培训教材

科技奥运成果推广

资深动画师mikewalling的工作流程

雅客VQ上市计划

生产线赋码系统V40--企业版用户手册

XXXX年绿城唐山南湖生态城项目简介推介手册

什么是卓越绩效模式

组织行为学第6章群体动力与激励理论

相关文档

相关搜索