基于LDA模型和卡方检验的网络暴力话题挖掘方法
作者:
作者单位:

1.蚌埠医学院,公共基础学院,安徽 蚌埠 233030;2.蚌埠医学院,卫生管理学院,安徽 蚌埠 233030

作者简介:

通讯作者:

基金项目:

安徽省哲学社会科学项目(AHSKQ2019D070)。


A LDA Model and Chi-Square Test based Approach for Topic Mining of Cyber Violence
Author:
Affiliation:

1.School of Public Basic, Bengbu Medical College, Bengbu, Anhui 233030, China;2.School of Health Management, Bengbu Medical College, Bengbu, Anhui 233030, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    目的 社交网络中存在着许多暴力话题,暴力话题识别对网络舆情的精准干预和管控具有十分重要的意义。当前网络暴力研究主要集中在用户负面情感计算、暴力用户识别等领域,缺乏对网络暴力组织构成研究,无法在复杂网络环境中精准识别网络暴力的附着载体。方法 通过分析网络暴力在话题内的聚焦特性,提出了一种基于LDA模型和卡方检验的网络暴力话题识别方法,该方法首先运用LDA模型识别网络语料库中的话题,并用相似度计算方法对话题文本进行分类;然后运用卡方检验筛选话题文本中的暴力特征;最后依据情感词典计算各话题内的暴力值,按照暴力密度判断话题的暴力属性。结果/结论 在真实的网络语料库上实验验证了本文方法,实验结果表明:本文方法的暴力话题识别性能(F值)均值为80.64%,优于对比方法,达到了良好的网络暴力话题识别效果。

    Abstract:

    Objective There are many violent topics in social networks. Violence topic identification is of great significance to the accurate intervention and control of network public opinion. At present, the research on cyber violence mainly focuses on the calculation of users'' negative emotions, violent user identification and other fields. There is a lack of fine-grained analysis and research on cyber violence, which can not accurately identify the attached carrier of cyber violence in the complex network environment.Method By analyzing the topic focusing characteristics of network violence, a topic recognition method of network violence based on LDA model and chi square test is proposed. Firstly, the topic in network corpus is recognized by LDA model, and the topic text is classified by similarity calculation method; Then the chi square test is used to screen the characteristics of violence in the topic text; Finally, the violence value of each topic is calculated according to the emotional dictionary, and the violence topic is judged according to the violence value.Result/Conclusion Experiments on a real network corpus verify the proposed method. The experimental results show that the average performance of violence topic recognition (F value) of this method is 80.64%, which is better than the comparison method, and achieves a good effect of network violence topic recognition.

    参考文献
    相似文献
    引证文献
引用本文

谢静,刘玉文.基于LDA模型和卡方检验的网络暴力话题挖掘方法[J].西昌学院学报(自然科学版),2022,36(4):103-109.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2022-06-07
  • 最后修改日期:2022-06-07
  • 录用日期:2022-07-18
  • 在线发布日期: 2023-01-13