您好,欢迎访问三七文档
当前位置:首页 > 商业/管理/HR > 公司方案 > 微博舆情热点分析系统
摘要Ⅰ摘要微博在这短短几年时间内发展迅速,成为了传播信息的一种很重要的载体,仅新浪微博注册用户早已超过3亿,每日发博量超过1亿条,微博是研究大量舆情信息的最佳场所。微博中如此大的信息量,必然要对信息进行甄选,这是很自然的需求,所以以原始驱动为出发点,从数亿网民之中找到热点话题。监督规范网络行为,净化网络环境,更重要的还能从中获取各类有用信息,进行诸如商业价值(用户兴趣挖掘),信息传播学(网络拓扑与热点追踪),以及一些社会学方面的研究。从而能够满足人们的需求,同时对于社会的和谐、网络舆论生态的健康、国家的发展都有重要的现实意义。微博舆情分析系统就是来实现热点事件的挖掘与分析,本文首先介绍了舆情分析的理论基础和一些相关算法,然后是对整个系统从零到整的完整开发记录,最后是通过本软件分析出微博的热点事件。关键词:微博,舆情分析,向量空间模型,k-means聚类算法ABSTRACTⅡAbstractWeibointhisfewyearshasdevelopedrapidly,andbecomeaimportanttoolsthatspreadinformation,onlytheuserofsinaweibohasmorethan300million,dailysendWeiboaremorethan100million,weiboisthebestplacetostudyalotofpublicinformation.Theweibosolargeamountofinformation,isboundtotheinformationselection,itisverynaturaldemand,sototheoriginaldriveasastartingpoint,findhottopicsfromamonghundredsofmillionsofInternetusers.Supervisionandstandardnetworkbehavior,purifynetworkenvironment,themoreimportantofthiscanobtainallkindsofusefulinformation,suchasbusinessvalue(user)interestedinmining,informationandcommunication(networktopologyandthehotspottracking),aswellassomesociologicalresearch.Sothattheycanmeetpeople'sneeds,atthesametimeforsocialharmony,ecologicalhealthnetworkpublicopinion,thecountry'sdevelopmenthasimportantpracticalsignificance.Microbloggingpublicopinionanalysissystemistoimplementthehotissuesminingandanalysis,firstly,thispaperintroducesthebasictheoryofpublicopinionanalysisandsomerelatedalgorithms,andthenintroducehowtobuildthewholesystem,finally,throughthesoftwareanalysisthemicroblogginghotissues.`KeyWords:Weibo,publicopinionanalysis,vectorspacemodel,k-means目录Ⅰ目录第一章引言.................................................................................................................................11.1.研究背景及意义........................................................................................................11.2.国内外相关问题研究现状........................................................................................11.3.论文的理论意义与实用价值....................................................................................3第二章相关技术介绍.................................................................................................................42.1开发平台简述...................................................................................................................42.1.1Eclipse简介............................................................................................................42.1.2新浪微博api............................................................................................................42.2OAuth2.0简介..................................................................................................................52.3空间向量模型...................................................................................................................62.4文本聚类算法...................................................................................................................92.5数据库方案......................................................................................................................112.5.1MYSQL简介..............................................................................................................112.5.2MYSQL背景..............................................................................................................112.5.3MYSQL系统特性......................................................................................................12第三章系统设计.......................................................................................................................133.1热点分析策略.................................................................................................................133.2系统总体设计..................................................................................................................133.3系统模块详细设计.........................................................................................................153.3.1数据获取模块..........................................................................................................153.3.2数据分析模块..........................................................................................................253.4数据库设计方案.............................................................................................................30第四章系统实现.......................................................................................................................324.1数据获取模块实现.........................................................................................................32目录4.1.1核心代码.................................................................................................................324.1.2相关时序图..............................................................................................................334.2数据处理模块实现.........................................................................................................354.2.1核心代码.................................................................................................................354.2.2相关时序图......................
本文标题:微博舆情热点分析系统
链接地址:https://www.777doc.com/doc-6346323 .html