基于 Hadoop 平台的主题概念股票挖掘系统应用研究
DOI:
作者:
作者单位:

作者简介:

通讯作者:

基金项目:

安徽省高校自然科学研究重点项目(KJ2019A1049)?2020 年安徽省级精品线下开放课程?WEB 程序设计( JSP)? (2020kfkc130)?


Application of Thematic Concept Stock Detecting System Based on
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    针对目前资本市场上快速挖掘某种主题概念股票的需求ꎬ提出了一种新思路ꎬ该思路以上市公司的核心题材、主营收 入和资本运作 3 项数据为基础ꎬ进行主题概念相关指数的分析和计算ꎬ最终以此指数作为标准推荐主题概念相关股票ꎬ并开 发了一套数据抓取程序和 Web 应用程序ꎮ 数据抓取程序利用定时组件 Quartz 从各大财经网站抓取全体上市公司已公开的各 类基本信息ꎬ存入分布式文件系统 HDFS 中ꎻWeb 应用程序接收用户输入的查询关键字组合ꎬ系统利用抓取的数据集从公司收 入、投资和核心概念 3 方面分析和计算出公司与用户需要查询的关键字组合的相关指数ꎬ最后汇总为总相关指数ꎬ总相关指 数越高的公司ꎬ其相关度越高ꎬ相关度越高的公司越有可能就是用户想要查找的相关主题概念公司ꎮ 通过这 3 方面的结合ꎬ在 公司的过去和未来ꎬ在定性和定量等多个方面都进行了相关度的挖掘ꎬ从而计算出来的相关性将更加可靠、准确ꎮ

    Abstract:

    In response to the demand of promptly detecting thematic concept stocksin the current capital marketꎬ this paper proposes a new approach which analyzes and calculates the correlated index of the theme concept based on the data of the core conceptꎬ main business income and capital operation of the listed companies. The outcome of the calculation provides a standard for selecting thematic concept stocks. This paper also develops a data capture program for catching various basic information from all listed companies and saving the data in the distributed file system HDFS with timing components Quartzꎬ and a Web application program which receives the query keyword combination from users and figures out correla ̄ ted index of the query keyword combination between the demand users of and that of companies in terms of the company′s incomeꎬ investment and core concept. At lastꎬ the program aggregatesall related index into the total correlation index. The higher the total correlation index isꎬ the higher the correlation degree isꎬ the more likely the company is to be the related thematic concept company that users want to search for.Through the combination of the three aspectsꎬ correlative degree is determined by the past and future of the company through qualitative and quantitative assessmentsꎬ therefore the calcula ̄ tion is more accurate and reliable.

    参考文献
    相似文献
    引证文献
引用本文

丁 俊.基于 Hadoop 平台的主题概念股票挖掘系统应用研究[J].西昌学院学报(自然科学版),2021,35(2):82-88.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-07-27