面向智能服务机器人的非特定人交互口令语音识别系统研究

硕士研究生学位论文题目:面向智能服务机器人的非特定人交互口令语音识别系统研究姓名:杨雪松学号:A0617218院系:软件与微电子学院专业:软件工程研究方向:嵌入式系统工程导师:刘宏教授二零零九年五月版权声明任何收存和保管本论文各种版本的单位和个人，未经本论文作者同意，不得将本论文转借他人，亦不得随意复制、抄录、拍照或以任何方式传播。否则，引起有碍作者著作权之问题，将可能承担法律责任。北京大学硕士学位论文面向智能服务机器人的非特定人交互口令语音识别系统研究----------------------------------------------------------------------------------------------------------------------------------------------------------深圳研究生院信息工程学院智能机器人开放实验室2009年5月—I—摘要随着科学的发展和计算机的普及，人们对于计算机的交流方式提出了更高的要求，这就促进了语音识别技术的发展，并使之成为语音处理领域中的一个重要研究方向。语音识别技术就是让机器能够通过识别和理解过程把语音信号转变为相应的文本或命令的高技术。近二十年来，语音识别技术取得显著进步，已经开始从实验室走向市场，在智能家居与智能机器人产品中得到应用。本文致力于对语音识别技术的基础研究，采用基于模板匹配方法的动态时间规整（DTW）算法实现了面向智能服务机器人的非特定人交互口令识别系统。在实验室的环境下，成功实现了基于该平台的人机问答交互系统。人机交互口令识别系统，不同于PC机上的系统，这需要考虑实用化中的人机安全性，一旦遇到易于混淆的单词，就可能带来意想不到的后果。因此，从大量预设词汇表中筛选适应性、鲁棒性较强的单词对于系统来说至关重要。本文通过实验数据分析和手工筛选，挑选出了30个较合适的单词作为词表，同时再次验证系统的可靠程度。在非特定人方面，本文提出了一种采用总体均值和方差信息相结合的方法，来解决不同说话人的语音模板训练问题，较以往方法的正确识别率有所提升。同时，该方法还用于处理易混淆的单词误识别问题，根据样本集训练得到的方差信息，来判定接受还是拒绝识别结果，如果拒绝识别结果，则选择次小匹配距离对应的模板作为修正识别结果，进而提高系统的单词正确识别率，其中应用了实用多元统计分析中的最大似然估计方法。此外，本文借助于微软提供的WindowsMultimediaAPI开发了在线实时语音采集程序，实现了人机在线实时交互。关键词：语音识别，孤立词，非特定人，人机口令交互北京大学硕士学位论文面向智能服务机器人的非特定人交互口令语音识别系统研究----------------------------------------------------------------------------------------------------------------------------------------------------------深圳研究生院信息工程学院智能机器人开放实验室2009年5月—II—AbstractWiththedevelopmentofScienceandthepopularizationofComputers,peoplehavemoredemandsonthemeansofhuman-computerinteraction,whichcanpromotethedevelopmentofspeechrecognitiontechnologyandmakeitasignificantresearchdirectioninthefieldofspeechsignalprocessing.SpeechrecognitiontechnologyisaHi-TechTechnology,whichcantranslatespeechsignalsintocorrespondingtextsorcommandsintheprocessofrecognitionandunderstanding.Overthepasttwodecades,speechrecognitiontechnologyhasmaderemarkableprogressandstartedtoapplyinsmarthomeandintelligentrobots.Thisthesisisdedicatedtothebasicresearchofspeechrecognitiontechnology,andhasimplementedanIntelligentServiceRobotorientedSpeaker-Independentvoicecommandsrecognitionsystem.ItadaptsthetemplatematchingmethodbasedonDynamicTimeWarping(DTW)algorithm.HRIvoicecommandsrecognitionsystemneedstotakeaccountofhuman-robotsecurityinpractice.It’sdifferentfromPCplatform,andcanbringinunexpectedconsequencesonceencounteringeasilyconfusedwords.Therefore,itiscrucialforthesystemtoselectrobustwordsinthepre-definedwordlist.Thisthesishasselected30robustwordsasthewordlistafterseveralexperimentaldataanalysisandmanualselection,anditprovesreliableintheexperiments.AsforSpeaker-Independent,anewmethodisproposedtotrainthespeechtemplates,whichisbasedonthefusionofoverallmeanandvarianceinformation,andithasimprovedthewordcorrectrecognitionrate.Meanwhile,thismethodcanbeusedtodealwitherrorrecognitionresultsofeasilyconfusedwordsaswell.Accordingtothevarianceinformation,whethertoacceptorrejecttherecognitionresultistobedetermined,andifrejected,thepatterncorrespondingtothesecondminimummatchingdistanceisrecognizedasmodifiedcorrectresult.Afterseveralexperiments,itproveseffectivetoimprove北京大学硕士学位论文面向智能服务机器人的非特定人交互口令语音识别系统研究----------------------------------------------------------------------------------------------------------------------------------------------------------深圳研究生院信息工程学院智能机器人开放实验室2009年5月—III—thecorrectrecognitionrate.Inall,MaximumLikelihoodEstimation(MLE)methodismainlyappliedinthethesis.Besidesthat,anonlinereal-timevoicecapturingprogramhasbeendevelopedusingMicrosoftWindowsMultimediaAPI,andinthisthesis,itisappliedintheIntelligentServiceRobot,whichisdevelopedbyOpenLabonIntelligentRobotics.Keywords:SpeechRecognition,IsolatedWords,SpeakerIndependent,Human-RobotvoicecommandsInteraction北京大学硕士学位论文面向智能服务机器人的非特定人交互口令语音识别系统研究----------------------------------------------------------------------------------------------------------------------------------------------------------深圳研究生院信息工程学院智能机器人开放实验室2009年5月—IV—目录摘要.................................................................................................................................IAbstract...........................................................................................................................II目录..............................................................................................................................IV图标索引...........................................................................................................................I第一章概述..................................................................................................................11.1意义....................................................................................................................11.2发展和现状........................................................................................................31.2.1国外现状..................................................................................................31.2.2国内现状..................................................................................................51.3研究和应用中面临的难题..................................

面向智能服务机器人的非特定人交互口令语音识别系统研究

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

房地产法平时作业一答案

批发及零售企业质量保证模式标准(doc26)(1)

担保法律实务_130627

21 成本曲线

关于大陆法系处分原则的发展趋势及对我国民事诉讼的借

第三章发展心理学研究方法-PowerPointPres

快餐店连锁店品牌设计公司晏钧设计作品

第四讲企业战略管理咨询

压力容器现场组焊工艺标(校对版)

仓库安全检查表

相关文档

相关搜索