经过mapreduce清洗后的数据如下(截取部分)
60.10.5.65 20130530220737 source/plugin/wmff_wxyun/img/wmff_zk.css
60.10.5.65 20130530220738 source/plugin/study_nge/js/HoverLi.js
60.10.5.65 20130530220741 home.php?mod=misc&ac=sendmail&rand=1369922680
60.10.5.65 20130530220742 favicon.ico
60.10.5.65 20130530220742 forum.php
60.10.5.65 20130530220742 source/plugin/wmff_wxyun/img/wx_jqr.gif
60.10.5.65 20130530220742 template/newdefault/style/t5/bgimg.jpg
60.10.5.65 20130530220744 data/attachment/common/cf/104854ejrssrbbfsfv6cn5.jpg
60.10.5.65 20130530220744 source/plugin/wmff_wxyun/img/wx_jqr.gif
60.10.5.65 20130530220744 template/newdefault/style/t5/bgimg.jpg
60.10.5.65 20130530220744 template/newdefault/style/t5/nv.png
60.10.5.65 20130530220744 template/newdefault/style/t5/nv_a.png
60.10.5.65 20130530220745 data/attachment/common/cf/104950hio3tgww8tgpqtcz.jpg
60.10.5.65 20130530220745 data/attachment/common/cf/105041vvvi7pgez0w1mvxv.jpg
60.10.5.65 20130530220745 data/attachment/common/cf/180036e72352fq3reerq13.jpg
60.10.5.65 20130530220745 home.php?mod=misc&ac=sendmail&rand=1369922680
60.10.5.65 20130530220745 source/plugin/study_nge/images/list10.gif
60.10.5.65 20130530220746 source/plugin/study_nge/images/listbg.gif
60.10.5.65 20130530220747 api/connect/like.php
3、使用hive对清洗后的数据进行多维分析
(1)统计每日的pv(浏览量)
hive> create table hmbbs_pv
> as select count(1) as pv from hmbbs_table;
(2)统计每日的register(注册用户数)
hive> create table hmbbs_register
> as select count(1) as register
> from hmbbs_table
> where instr(urllog,'member.php?mod=register') > 0;
本文来自电脑杂谈,转载请注明本文网址:
http://www.pc-fly.com/a/jisuanjixue/article-28332-3.html
就算原料里有虫卵但后续的磨粉再到高温烤熟密封包装都会让虫卵坏死
桃子
就你们都不知道被轮奸多少次了