• /  53
  • 下载费用: 21.9积分  

贝叶斯算法相关论文

'贝叶斯算法相关论文'
学位论文题目:基于贝叶斯算法分类的反垃圾邮件系统的改进摘 要电子邮件成为一种快捷、经济的现代通信技术手段,极大地方便了人们的通信与 交流。然而,垃圾邮件的产生,影响了止常的电了邮件通信,占用了传输带宽,对系 统安全造成了严重的威胁。因此,研究反垃圾邮件问题已经成为全球性的具有重大现 实意义的课题。目前,应对垃圾邮件的主要方法和手段是通过反垃圾邮件立法和使用邮件过滤技 术进行处理,现已相继出现了多种邮件过滤技术。常用的包括黑/白名单技术、基于内 容的分析方法以及基于规则的方法等;谀谌莘治龅募际跽鸩浇胗始思际 当中,并成为当前研究热点,其中,基于内容分析的邮件过滤方法中的典型方法是基 于贝叶斯算法的垃圾邮件过滤模型。本论文对中文垃圾邮件的特点进行了比较系统的分析和研究,结合贝叶斯(Bayes) 理论,构造基于贝叶斯分类的垃圾邮件过滤模型,在特征提取方面,采用互信息值的 方法,在分类方法上,引入了适合本文的分类方法,并采用了一种更加适合于贝叶斯 计算的表示方法;本文作者采用中国教育科研网(CERNET)收集并维护的大量中文 垃圾邮件和止常邮件样木的标准数据集,对木文研究的方法进行了大量测试,准确率 和误判率分别达到了 95.8%和5.3%。结果表明基于贝叶斯算法的垃圾邮件过滤系统对 拦截垃圾邮件有很好的作用。关键词:电子邮件,垃圾邮件,邮件过滤,贝叶斯理论AbstractThe e-mail has become a quick and economical means of modern communication technology, which enormously facilitates people's communication and exchanges. However, the emergence of spam has affected the normal email correspondence, and taken the transmission band width, even posed the serious threat to the system safety. Therefore, the study of anti-spam has become a global problem of great practical significance of the topic.At present, the main ways and means of the response to spam are the anti-spam legislation and the use of mail filtering technology. But now a variety of mail filtering technologies have appeared in succession, which are usually used including black / white list technologies, content-based analysis methods, and rule-based methods. Content-based analysis techniques are gradually entering the mail filtering technology which has become hot spots of current research. The typical method of content-based analysis mail filtering methods is based on Bayesian algorithm for spam filtering model.In this paper, the Chinese characteristics of spam has been studied and analyzed systematically. Combining with Bayesian (Bayes) theory, this paper constructs the spam filtering model which is based on Bayesian classification. In feature extraction, mutual information values are used. In the classification method, a classification method is introduced which is suitable in this article, and a more suitable expression in the Bayesian calculation method is adopted; the standard sample data sets of a large number of Chinese spam and regular mail are collected and maintained by the Chinese Education and Research Net (CERNET). The author conducted a lot of testing towards the methods which are studied by this paper. The accuracy and misjudgment rate reached 95.8% and 5.3% respectively. The results show that the spam filtering system based on algorithm Bayesian plays a very good role to block spam?Key Words: e-mail, spam, mail filtering, Bayesian theory第一章绪论 11.1弓I言 11.2垃圾邮件的定义及其: 11.2.1垃圾邮件的定义 11.2.2垃圾邮件的: 21.3国内外反垃圾邮件现状 31.4论文研究的目标与内容 4第二章垃圾邮件技术 52」电子邮件工作原理简介 52.1.1电子邮件的概述 52.1.2电子邮件的格式 52.1.3邮件的传送过程 62」.4相关协议 82.2非技术手段反垃圾邮件 132.3常用反垃圾邮件技术 132.3.1客户端反垃圾邮件过滤技术 142.3.2服务器端反垃圾邮件过滤技术 14第三章垃圾邮件分类向量与特征向量 173」垃圾邮件分类向量概述 173.2垃圾邮件分类向量与特征向量的定义 173.3分类方法 183.3.1文本量的表示方法 183.3.2关键词的选取 193.3.3特征提取 203.3.
关 键 词:
贝叶斯 算法 相关 论文
 剑锋文库所有资源均是用户自行上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作他用。
关于本文
本文标题:贝叶斯算法相关论文
链接地址: //www.wenku365.com/p-43728947.html
关于我们 - 网站声明 - 网站地图 - 资源地图 - 友情链接 - 网站客服点击这里,给剑锋文库发消息,QQ:1290478887 - 联系我们

本站为“文档C2C交易模式”,即用户上传的文档直接卖给(下载)用户,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有【成交的100%(原创)】。本站是网络服务平台方,若您的权利被侵害,侵权客服QQ:1290478887 欢迎举报。

[email protected] 2017-2027 //www.wenku365.com 网站版权所有

粤ICP备19057495号 

收起
展开