intel vtune9 是否没有动态分析功能

chenxu_ustc 2009-03-21 03:23:05
我在学习代码优化
上面介绍vtune有动态分析功能 ,就是展开Hotspot看函数的汇编代码 ,然后可以设定需要分析从哪里开始的几行汇编代码 ,
运行动态分析功能 ,vtune就能够出一个报告,内容有: L1cache miss次数 L2cache miss次数, dataalignment 没对齐的访问次数等

可是我在vtune9下面一直没有找到
amd的codeanalize 有这个功能 ,难道现在只能使用amd cpu来做这部分的代码优化了么

vtune 动态分析的报告样例

Dynamic Analysis Results for the Chosen Hotspot


Decoder Minimum Clocks = 0, ; // Minimum decoding time: 0 clocks
Decoder Average Clocks = 0.7 ; // Average decoding time: 0.7 clock
Decoder Maximum Clocks = 14 ; // Maximum decoding time: 14 clocks

Retirement Minimum Clocks = 0, ; // Minimum retirement time: 0 clocks
Retirement Average Clocks = 6.9 ; // Average retirement time: 6.9 clocks
Retirement Maximum Clocks = 104 ; // Maximum retirement time: 104 clocks

Total Cycles = 20117 (35.88%) ; // Total execution time: 20,117 clocks

Micro-Ops for this instruction = 1 ; // Decoding occurs in one micro-operation

The instruction had to wait (0,0.1,2) cycles for its sources to be ready.

Warnings: 3*decode_slow:0 ; // No decoder conflicts

Dynamic Penalty: DC_rd_miss
The operand of this load instruction was not in the data cache. The
instruction stalls while the processor loads the specified address location
from L2 cache or the main memory.
Occurrences = 1 ; // This happened once.

Dynamic Penalty: DC_misalign
The instruction stalls because it accessed data that was split across two
data-cache lines.
Occurrences = 2000 ; // This happened 2,000 times.

Dynamic Penalty: L2data_rd_miss
The operand of this load instruction was not in L2 cache. The instruction
stalls while the processor loads the specified address location from the
main memory.
Occurrences = 1 ; // This happened once.

Dynamic Penalty: No_BTB_info
The BTB (Branch Target Buffer) does not contain information about this branch.
The branch was predicted using the static branch prediction algorithm.
Occurrences = 1 ; // This happened once.


...全文
91 2 打赏 收藏 转发到动态 举报
写回复
用AI写文章
2 条回复
切换为时间正序
请发表友善的回复…
发表回复
chenxu_ustc 2009-03-23
  • 打赏
  • 举报
回复
谢谢, 就是Tuning Assistance.
Tuning Assistance 上没有我要的cache等信息 是由于我用的queck ananlysis ,这个里面的事件信息是默认的
, 选择complete analysis里可以添加监视的events ,就可以了.
intel_zhenyuwang 2009-03-23
  • 打赏
  • 举报
回复
根据你的结果信息,我想你可能使用的是Intel? 的Tuning Assistance (在Source View上选中一段代码)- 有什么问题吗?

不理解你的问题-
“可是我在vtune9下面一直没有找到
amd的codeanalize 有这个功能 ,难道现在只能使用amd cpu来做这部分的代码优化了么 ”

请仔细阅读Release notes of Intel(R) VTune(TM) Performance Analyzer 9.1

566

社区成员

发帖
与我相关
我的任务
社区描述
英特尔® 边缘计算,聚焦于边缘计算、AI、IoT等领域,为开发者提供丰富的开发资源、创新技术、解决方案与行业活动。
社区管理员
  • 英特尔技术社区
  • shere_lin
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告
暂无公告

试试用AI创作助手写篇文章吧