Last modified: 2014-04-16 20:18:26 UTC
Created attachment 14954 [details] Load graph for analytics 1018 while running the queries (2014-03-28) When running three hive queries on a week's worth of mobile request data, load on the Hadoop cluster nodes rises above the number of CPUs. Should we limit the resources that Hive/Hadoop can take on those machines?
Prioritization and scheduling of this bug is tracked on Mingle card https://wikimedia.mingle.thoughtworks.com/projects/analytics/cards/cards/1503
Created attachment 14955 [details] Load graph for analytics 1019 while running the queries (2014-03-28)
Created attachment 14956 [details] SQL for first query
Created attachment 14957 [details] SQL for second query
Created attachment 14958 [details] SQL for third query
Agree -- perhaps we need to tune the number of slots for mappers/reducers?