翻译 - 三七文档

PIDALION:ImplementationissuesofaJava-basedMultimediaSearchEngineoverthewebDimitrisE.Charilas,OuraniaI.MarkakiNationalTechnicalUniversityofAthens,DepartmentofElectricalandComputerEngineering,Keywords:multimediacontent,queries,content-basedretrieval,multimediacrawler,metadata,imagehistogram,hierarchicalpresentationAbstract-Fuelledbytherapidexpansionofbroadbandconnectivityandincreasinginterestinonlinemultimedia-richapplications,thegrowthofdigitalmultimediacontenthasskyrocketed.Amongothers,thisgrowthiscompoundingtheneedformoreeffectivemethodsforsearchingmultimediainformation.Theautomatedwebsearchenginesthatarecurrentlyusedrelyonlyontextdescriptionsandasaresultprovidematchesofpoorqualityincaseofmultimediacontent.Theservicesofamultimediasearchenginearethereforeapossibilitythattheinternetusersstilllack.Thus,thescopeofthispaperistopresentanimplementationapproachforapersonalizedweb-basedmultimediasearchengineintheJavaprogramminglanguage.Thisapproachcombinesthecharacteristicsofthecurrentsearchenginesaswellasnewinnovativefeatureswhichguaranteeatthesametimethesystem’squickresponseandbettersearchresults.Inthispaperthereadercanfindananalyticalpresentationofallthecomponentsrequiredtoformamultimediasearchengine,aswellasindicationsonhowtoimplementkeyalgorithmsandfunctions.1.INTRODUCTIONThewebcreatesnewchallengesforinformationretrieval.Theamountofinformationonthewebisgrowingrapidlyandsoisthenumberofnewusersinexperiencedintheartofwebresearch.Itisestimatedthat1-2Exa-Bytes(millionsofTera-Bytes)ofnewinformationarecreatedeachyearovertheWeb.Thishugeamountofinformationisanticipatedtogrowbyafactorof10inthefollowingtwoyears.Automatedsearchenginesthatrelyonkeywordmatchingusuallyreturntoomanylowqualitymatches.Thesituationisworseasfarasmultimediacontentisconcerned.Themostpopularsearchengine,Google[1],reliesonlyonkeywordstosearchforimagesanddoesnotcontainanyinformationonsemanticcontent.Content-basedimageretrievalsystems(CBIR)trytosolvethisproblem.ManyCBIRsystemshavebeenrecentlyproposedandimplementedintheliterature.ExamplesincludetheQBICsystem[2],wherecolourinformationisexploited,thePicToSeeksystem[3],whichcombinescolourandshapeinvariantfeaturestoperformimageretrievalandVirage[4]thatallowstheuserstomanuallyregulatetheimportanceoftheextracteddescriptorsaccordingtotheirownperception.Fuzzyorganizationofthedescriptorsisproposedin[5]forincreasingtheretrievalprecisionatacertainrecallvalue,while3Dsearchingisdiscussedin[6].Applicationsofcontent-basedretrievalsystemsareexaminedin[7],whilein[8]asystemregardingmusicaccessisproposed.Personalizedretrievalisexaminedintheworkpresentedin[9].Lastbutnotleast,Marvelthelatestandmoreintelligentcontent-basedsearchengine,developedbytheIBMresearchcentre,USAin2004[10],triestoincreasetheretrievalprecisionaccuracybyincorporatingsemanticannotationinthemediavolumes.However,alltheadoptedapproacheshavestaticandlocalaccessonlytothesystem’sdatabaseandthuscannotretrievecontentfromtheweb[11].Furthermore,theaforementionedworksfocusonthealgorithmsforefficientcontent-basedretrievalandnotonthepracticalissuesregardingtheimplementationofalargescalemultimediasearchengineovertheWeb.Sofar,severaldifferenttechniquesformakingdistributedmultimediacontentsearchablehavebeenproposed.In[12]thereisinformationonthetechniquesofcheckingtheoutgoinglinks,analyzingthereferringpage,miningfortextualinformationinthemediafileandutilizingmetadatausingtheDublinCoremetadatamodelortheMPEG-7standard.Thispaperfocusesondescribingamultimediasearchenginethatcombinesfeaturesfromexistingsearchenginesandenhancestheirfunctionalitiesthroughinnovativealgorithmsandmechanisms.Ourgoalisnotonlytodescribethesystem’sarchitectureandinterconnectivity,butalsotoexplainhowthealgorithmscanbeimplementedinJavacode.Theproposedsystem,namedPIDALION,runsonWindowsenvironment,whiletheJavaServerPages(JSP)andJavaServletstechnologiesareadoptedtoensurethesystem’sinteroperabilityanddynamicbehaviour.Thesystem’sdatabaserunsonSQLServer2000.Oneofthekeyfeaturesoftheproposedsearchengineistheprovisionoffullypersonalizedretrievalservices:usersofPIDALIONmaysharetheirpersonalcontenteitherwithallwebusersorwithintheframeofgroups,aswellasmaintainapersonalprofile,wheretheirpreferencesarestored.Personalizedretrievalcanbeachievedthroughthecreationofsocialgroupsandtheuseofdynamicrelevancefeedbackmechanisms,whichtailorthesystem’sperformancetothecurrentuser’spreferences.Thispaperisorganizedasfollows:Section2presentsthesystem’sarchitecture,explainingbrieflytheroleofeachmaincomponent.Sections3to7presentthefunctionality,architectureandkeyfeatures-innovationsofeachcomponent.Keyalgorithmsaredepictedintheformofpseudo-code.Finally,inSection8theissuescoveredinthispaperaresummarizedandfutureexpansionsareproposed.2.SYSTEMOVERVIEWTheplatformdescribedinthispaperconsistsofthefollowingsubsystems:•Themultimediacrawlingsubsystem,whoseroleistoindexmultimediacontentandhandletheupdatingofheindexingprocess•Themultimediametadatasubsystem,whichextractsmetadatafrommultimediacontent,accordingtotheMPEG-7descriptorsachievinginthiswa

翻译

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

资源环境与城市规划专业自荐书

3标段烟囱、水塔及循环水系统施工组织设计

二次结构施工方案518

锤击混凝土预制桩、钢桩施工记录表GD2301012

我国环境民事纠纷行政处理机制的法律性质解析及构建

江山网络游戏简要策划案

混凝土预制构件和商品混凝土生产企业资质管理规定

船舶名称管理办法

9000管理手册-Q

云南省高等学校教学改革研究项目

相关文档

相关搜索