请问 hadoop 异构集群 优化配置
小小小超人 2014-01-20 03:07:18 A型机:内存7.5G,CPU 2个
B型机:内存1.7G, CPU1个
执行语句:
insert overwrite table archive_seller_by_geo_per_day partition(partDate, shard) select sellerTokenID, partDate archiveDate, countryCode, country, countryState, countryCity, count(*) pv, count(distinct ip) uv, partDate, shard from page_visit where partDate >= ${THREE_DAY[1]} and partDate <= ${THREE_DAY[3]} group by sellerTokenID, countryCode, country, countryState, countryCity, partDate, shard having sellerTokenID>0;
page_visit 表中接近四千万数据将近800秒
slave为3A+3B一起跑与3A单独跑速度差不多。
请问在异构集群下可以做哪些优化和配置?