hahoop下kmeans聚类成功过后 需要解析出详细的聚类内容 应该用什么方法?

qq_35364722 2017-05-24 06:03:55
这是聚类后的输出结果文件
-rw-r--r-- 3 Administrator supergroup 194 2017-05-24 05:59 /user/newjobskill/_policy
drwxr-xr-x - Administrator supergroup 0 2017-05-24 05:59 /user/newjobskill/clusteredPoints
drwxr-xr-x - Administrator supergroup 0 2017-05-24 05:59 /user/newjobskill/clusters-0
drwxr-xr-x - Administrator supergroup 0 2017-05-24 05:59 /user/newjobskill/clusters-1-final


这是查看clusteredPoints这个文件的结果
Key: 2: Value: wt: 1.0 distance: 0.2436398126020346 vec: [{"27":0.149},{"41":0.149},{"76":0.149},{"84":0.149},{"87":0.149},{"91":0.149},{"107":0.418},{"115":0.149},{"118":0.149},{"123":0.149}]
Key: 2: Value: wt: 1.0 distance: 0.20707680037322207 vec: [{"38":0.152},{"52":0.152},{"54":0.152},{"64":0.152},{"93":0.152},{"107":0.428},{"117":0.152},{"123":0.152}]
Key: 2: Value: wt: 1.0 distance: 0.23111287221912813 vec: [{"40":0.143},{"41":0.143},{"48":0.143},{"52":0.143},{"53":0.143},{"54":0.143},{"68":0.143},{"76":0.143},{"77":0.143},{"80":0.143},{"107":0.401},{"117":0.143},{"119":0.143},{"123":0.143}]
Key: 2: Value: wt: 1.0 distance: 0.3221210688268743 vec: [{"14":0.131},{"16":0.131},{"19":0.131},{"23":0.131},{"27":0.131},{"32":0.131},{"42":0.131},{"52":0.131},{"54":0.131},{"65":0.131},{"68":0.131},{"76":0.131},{"87":0.131},{"102":0.131},{"105":0.131},{"107":0.369},{"110":0.131},{"117":0.131},{"119":0.131},{"123":0.131},{"124":0.131},{"125":0.131},{"126":0.131}]
Key: 2: Value: wt: 1.0 distance: 0.17536038428224132 vec: [{"27":0.158},{"99":0.158},{"107":0.444},{"123":0.158},{"126":0.158}]
Key: 2: Value: wt: 1.0 distance: 0.25344517894733176 vec: [{"52":0.146},{"53":0.146},{"54":0.146},{"68":0.146},{"71":0.146},{"76":0.146},{"77":0.146},{"84":0.146},{"97":0.146},{"98":0.146},{"107":0.409},{"126":0.146}]
Key: 2: Value: wt: 1.0 distance: 0.35433890904804866 vec: [{"2":0.14},{"38":0.14},{"54":0.14},{"65":0.14},{"71":0.14},{"76":0.14},{"84":0.14},{"89":0.14},{"93":0.14},{"94":0.14},{"98":0.14},{"100":0.14},{"107":0.393},{"110":0.14},{"125":0.14},{"126":0.14}]
Key: 1: Value: wt: 1.0 distance: 0.36439959171819736 vec: [{"9":0.139},{"11":0.139},{"26":0.139},{"34":0.139},{"38":0.139},{"41":0.139},{"54":0.139},{"71":0.139},{"72":0.139},{"76":0.139},{"84":0.139},{"93":0.139},{"97":0.139},{"108":0.389},{"110":0.139},{"124":0.139},{"126":0.139}]
Key: 1: Value: wt: 1.0 distance: 0.22973649835541288 vec: [{"14":0.151},{"16":0.151},{"36":0.151},{"51":0.151},{"68":0.151},{"70":0.151},{"82":0.151},{"108":0.423},{"111":0.151}]
Key: 1: Value: wt: 1.0 distance: 0.21114737678861495 vec: [{"40":0.156},{"41":0.156},{"58":0.156},{"88":0.156},{"108":0.438},{"111":0.156}]
Key: 3: Value: wt: 1.0 distance: 0.1355259414583404 vec: [{"41":0.156},{"43":0.438},{"62":0.156},{"63":0.156},{"88":0.156},{"114":0.156}]
Key: 1: Value: wt: 1.0 distance: 0.22220658880175626 vec: [{"41":0.154},{"63":0.154},{"69":0.154},{"88":0.154},{"96":0.154},{"108":0.433},{"111":0.154}]
Key: 1: Value: wt: 1.0 distance: 0.3929836377324506 vec: [{"27":0.144},{"28":0.144},{"31":0.144},{"56":0.144},{"59":0.144},{"61":0.144},{"66":0.144},{"79":0.144},{"87":0.144},{"108":0.405},{"113":0.144},{"115":0.144},{"119":0.144}]
Key: 1: Value: wt: 1.0 distance: 0.21289260005767163 vec: [{"5":0.16},{"22":0.16},{"108":0.45},{"116":0.16}]
Key: 1: Value: wt: 1.0 distance: 0.12085474423746512 vec: [{"16":0.151},{"25":0.151},{"41":0.151},{"54":0.151},{"68":0.151},{"76":0.151},{"81":0.151},{"108":0.423},{"123":0.151}]
Key: 1: Value: wt: 1.0 distance: 0.12085474423746512 vec: [{"16":0.151},{"25":0.151},{"41":0.151},{"54":0.151},{"68":0.151},{"76":0.151},{"81":0.151},{"108":0.423},{"123":0.151}]
Key: 1: Value: wt: 1.0 distance: 0.12085474423746512 vec: [{"16":0.151},{"25":0.151},{"41":0.151},{"54":0.151},{"68":0.151},{"76":0.151},{"81":0.151},{"108":0.423},{"123":0.151}]
Key: 1: Value: wt: 1.0 distance: 0.12085474423746512 vec: [{"16":0.151},{"25":0.151},{"41":0.151},{"54":0.151},{"68":0.151},{"76":0.151},{"81":0.151},{"108":0.423},{"123":0.151}]
Key: 1: Value: wt: 1.0 distance: 0.14363401814153398 vec: [{"36":0.156},{"68":0.156},{"70":0.156},{"99":0.156},{"108":0.438},{"123":0.156}]
Key: 1: Value: wt: 1.0 distance: 0.1535421530103479 vec: [{"24":0.156},{"51":0.156},{"54":0.156},{"76":0.156},{"108":0.438},{"123":0.156}]
Key: 1: Value: wt: 1.0 distance: 0.1629873475608017 vec: [{"52":0.158},{"99":0.158},{"102":0.158},{"108":0.444},{"123":0.158}]
Key: 2: Value: wt: 1.0 distance: 0.18588691166425686 vec: [{"3":0.154},{"9":0.154},{"10":0.154},{"41":0.154},{"76":0.154},{"107":0.433},{"123":0.154}]
Key: 1: Value: wt: 1.0 distance: 0.16338928595626734 vec: [{"5":0.152},{"24":0.152},{"52":0.152},{"68":0.152},{"99":0.152},{"102":0.152},{"108":0.428},{"123":0.152}]
Key: 1: Value: wt: 1.0 distance: 0.20466059695856897 vec: [{"5":0.147},{"21":0.147},{"27":0.147},{"41":0.147},{"61":0.147},{"63":0.147},{"68":0.147},{"69":0.147},{"108":0.414},{"111":0.147},{"123":0.147}]
Key: 1: Value: wt: 1.0 distance: 0.27366028655933317 vec: [{"14":0.13},{"22":0.13},{"24":0.13},{"36":0.13},{"41":0.13},{"52":0.13},{"60":0.13},{"63":0.13},{"64":0.13},{"68":0.13},{"70":0.13},{"75":0.13},{"76":0.13},{"78":0.13},{"79":0.13},{"80":0.13},{"82":0.13},{"85":0.13},{"88":0.13},{"99":0.13},{"102":0.13},{"108":0.365},{"111":0.13},{"123":0.13}]
Key: 1: Value: wt: 1.0 distance: 0.24700480397126368 vec: [{"0":0.137},{"7":0.137},{"19":0.137},{"24":0.137},{"30":0.137},{"33":0.137},{"35":0.137},{"41":0.137},{"47":0.137},{"52":0.137},{"54":0.137},{"68":0.137},{"76":0.137},{"84":0.137},{"88":0.137},{"108":0.386},{"111":0.137},{"123":0.137}]
Key: 1: Value: wt: 1.0 distance: 0.20439377074569198 vec: [{"1":0.147},{"5":0.147},{"24":0.147},{"36":0.147},{"52":0.147},{"68":0.147},{"70":0.147},{"99":0.147},{"108":0.414},{"116":0.147},{"123":0.147}]
Key: 5: Value: wt: 1.0 distance: 0.20894758954681814 vec: [{"9":0.144},{"11":0.144},{"14":0.144},{"25":0.144},{"41":0.144},{"52":0.144},{"54":0.144},{"76":0.144},{"81":0.144},{"106":0.144},{"109":0.405},{"111":0.144},{"123":0.144}]
Key: 5: Value: wt: 1.0 distance: 0.3072475447188747 vec: [{"38":0.156},{"71":0.156},{"90":0.156},{"93":0.156},{"103":0.156},{"109":0.438}]
Key: 5: Value: wt: 1.0 distance: 0.12008013760053915 vec: [{"41":0.151},{"52":0.151},{"54":0.151},{"68":0.151},{"76":0.151},{"102":0.151},{"105":0.151},{"109":0.423},{"111":0.151}]
Key: 5: Value: wt: 1.0 distance: 0.1379542638331912 vec: [{"14":0.147},{"36":0.147},{"52":0.147},{"54":0.147},{"63":0.147},{"68":0.147},{"76":0.147},{"82":0.147},{"102":0.147},{"109":0.414},{"123":0.147}]
Key: 5: Value: wt: 1.0 distance: 0.11419869503289126 vec: [{"52":0.156},{"68":0.156},{"71":0.156},{"76":0.156},{"109":0.438},{"123":0.156}]
Count: 113
17/05/24 06:01:53 INFO MahoutDriver: Program took 2315 ms (Minutes: 0.03858333333333333)



我需要怎么才能把这样的结果解析成聚类前的那种一条一条的汉字数据?
...全文
308 2 打赏 收藏 转发到动态 举报
写回复
用AI写文章
2 条回复
切换为时间正序
请发表友善的回复…
发表回复
qq_35364722 2017-05-27
  • 打赏
  • 举报
回复
有大神看了吗
qq_35364722 2017-05-24
  • 打赏
  • 举报
回复
求会打大神详细说明一下 感谢

20,811

社区成员

发帖
与我相关
我的任务
社区描述
Hadoop生态大数据交流社区,致力于有Hadoop,hive,Spark,Hbase,Flink,ClickHouse,Kafka,数据仓库,大数据集群运维技术分享和交流等。致力于收集优质的博客
社区管理员
  • 分布式计算/Hadoop社区
  • 涤生大数据
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告
暂无公告

试试用AI创作助手写篇文章吧