Last modified: 2014-06-25 20:45:55 UTC
Report browser stats from hadoop data. Our current browser stats come from squid reports: http://stats.wikimedia.org/wikimedia/squids/SquidReportClients.html We should work towards replacing these reports with pageview data from hadoop + ua parser. Erik Z. suggested that we can "pipe data from new input stream into old reports via csv files"
>Erik Z. suggested that we can "pipe data from new input stream into old reports via csv files" There is no need to do this and new reports can be generated from hadoop directly. This work is contingent on the work that we are currently doing in hadoop/kafka to productionize the setup and migrate to latest cloudera release.
Here's the original email with the request: https://lists.wikimedia.org/mailman/private/analytics-internal/2014-June/001664.html