首页 > 其他 > 详细

A Realtime BigData Dashboad

时间:2020-07-11 21:32:20      阅读:39      评论:0      收藏:0      [点我收藏+]

Purpose

Reference and mock the course practice, http://dblab.xmu.edu.cn/post/8274/

Its business flow is as below picture.

Inspired by this project,  and reuse the technologies to implement an feature:

(1) scrawl information from the URL

https://en.wikipedia.org/wiki/COVID-19_pandemic

(2) leverage spark streaming to count words times(words stats).

(3) push word stats to browser to display the imediate info to end user.

 

Take an exercise for bigdata realtime streaming app for improving competence.

https://github.com/fanqingsong/Realtime-BigData-Dashboard

A demo for realtime dashboard, based on bigdata technology and popular realtime comunication web technology.

 

技术分享图片

 

 

 

 

Architect

  • scrawler.py ---> kafka
  • kafka ---> wordCounter.py
  • wordCounter.py ---> kafka
  • kafka ---> app.py
  • app.py ---> browser

 

Technology

bigdata techs:

  • kafka -- tranfer all data between components
  • spark streaming -- data statistics
  • scrawler -- get raw data from url.

 

web techs:

  • flask -- python web framework
  • socket.io -- frontend/backend data exchange tunnel
  • vue -- popular frontend JS framework

Install & Run

reference:

https://github.com/fanqingsong/Realtime-BigData-Dashboard

 

Demo

run srawler app, then go to browser to see the imediate statistics.

python3 backend/scrawler.py

技术分享图片

 

A Realtime BigData Dashboad

原文:https://www.cnblogs.com/lightsong/p/13285226.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!