Guest User

carrizoinatructionlatency

a guest
Oct 2nd, 2015
897
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
  1. ------[ Versions ]------
  2.  
  3. Program Version : AIDA64 v5.30.3500
  4. BenchDLL Version: 4.1.643-x64
  5.  
  6. ------[ CPU Info ]------
  7.  
  8. CPU Type : QuadCore AMD FX-8800P, 1400 MHz (14 x 100)
  9. CPU Alias : Carrizo
  10. CPU Platform : Socket FP4
  11. CPU Stepping : CZ-A1
  12. Instruction Set : x86, x86-64, MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, SSE4A, XOP, AVX, AVX2, FMA, FMA4, AES
  13. CPUID Manufacturer: AuthenticAMD
  14. CPUID CPU Name : AMD FX-8800P Radeon R7, 12 Compute Cores 4C+8G
  15. CPUID Revision : 00660F01h
  16. Platform ID : DFh (Socket FP4)
  17. HTT / CMP Units : 0 / 4
  18. Max. NUMA Node : 0
  19.  
  20. Tjmax Temperature : 0 Celsius
  21. HTC Temperature Limit: 100 Celsius
  22. CPU TDP : 15.0 W
  23.  
  24. Inst 0 X86 : NOP L: [no true dep.] T: 0.18ns= 0.38c
  25. Inst 1 X86 : 0x66 NOP L: [no true dep.] T: 0.18ns= 0.38c
  26. Inst 2 X86 : 2x 0x66 NOP L: [no true dep.] T: 0.18ns= 0.38c
  27. Inst 3 X86 : 3x 0x66 NOP L: [no true dep.] T: 0.18ns= 0.38c
  28. Inst 4 X86 : 4x 0x66 NOP L: [no true dep.] T: 10.22ns= 21.42c
  29. Inst 5 X86 : 5x 0x66 NOP L: [no true dep.] T: 10.38ns= 21.75c
  30. Inst 6 X86 : 6x 0x66 NOP L: [no true dep.] T: 10.58ns= 22.17c
  31. Inst 7 X86 : 7x 0x66 NOP L: [no true dep.] T: 10.74ns= 22.50c
  32. Inst 8 X86 : 8x 0x66 NOP L: [no true dep.] T: 15.23ns= 31.92c
  33. Inst 9 X86 : 9x 0x66 NOP L: [no true dep.] T: 15.39ns= 32.25c
  34. Inst 10 X86 : 10x 0x66 NOP L: [no true dep.] T: 15.59ns= 32.67c
  35. Inst 11 X86 : 11x 0x66 NOP L: [no true dep.] T: 15.74ns= 33.00c
  36. Inst 12 X86 : 12x 0x66 NOP L: [no true dep.] T: 20.24ns= 42.42c
  37. Inst 13 X86 : 13x 0x66 NOP L: [no true dep.] T: 20.40ns= 42.75c
  38. Inst 14 X86 : 14x 0x66 NOP L: [no true dep.] T: 20.59ns= 43.17c
  39. Inst 15 SSE2 : PAUSE L: [no true dep.] T: 2.23ns= 4.67c
  40. Inst 16 X86 : MOV r8, imm8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  41. Inst 17 X86 : MOV r16, imm16 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  42. Inst 18 X86 : MOV r32, imm32 L: 0.19ns= 0.4c T: 0.19ns= 0.39c
  43. Inst 19 AMD64 : MOV r64, imm64 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  44. Inst 20 X86 : MOV r8, r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  45. Inst 21 X86 : MOV r16, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  46. Inst 22 X86 : MOV r32, r32 L: 0.52ns= 1.1c T: 0.18ns= 0.37c
  47. Inst 23 AMD64 : MOV r64, r64 L: 0.52ns= 1.1c T: 0.18ns= 0.38c
  48. Inst 24 X86 : MOV r8, [m8] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  49. Inst 25 X86 : MOV r16, [m16] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  50. Inst 26 X86 : MOV r32, [m32] L: 2.86ns= 6.0c T: 0.36ns= 0.75c
  51. Inst 27 AMD64 : MOV r64, [m64] L: 2.86ns= 6.0c T: 0.36ns= 0.75c
  52. Inst 28 X86 : MOV [m8], r8 L: [memory dep.] T: 0.72ns= 1.50c
  53. Inst 29 X86 : MOV [m16], r16 L: [memory dep.] T: 0.72ns= 1.50c
  54. Inst 30 X86 : MOV [m32], r32 L: [memory dep.] T: 0.72ns= 1.50c
  55. Inst 31 X86 : MOV [m32 + 8], r32 L: [memory dep.] T: 0.72ns= 1.50c
  56. Inst 32 AMD64 : MOV [m64], r64 L: [memory dep.] T: 0.72ns= 1.50c
  57. Inst 33 AMD64 : MOV [m64 + 16], r64 L: [memory dep.] T: 0.72ns= 1.50c
  58. Inst 34 X86 : MOV r8,[m8]+MOV [m8],r8 L: 3.58ns= 7.5c T: 1.59ns= 3.33c
  59. Inst 35 X86 : MOV r16,[m16]+MOV [m16],r16 L: 34.27ns= 71.8c T: 7.32ns= 15.33c
  60. Inst 36 X86 : MOV r32,[m32]+MOV [m32],r32 L: 33.00ns= 69.2c T: 13.48ns= 28.25c
  61. Inst 37 AMD64 : MOV r64,[m64]+MOV [m64],r64 L: 33.00ns= 69.2c T: 7.08ns= 14.83c
  62. Inst 38 SSE2 : MOVNTI [m32], r32 L: [memory dep.] T: 2.17ns= 2.17c
  63. Inst 39 AMD64 : MOVNTI [m64], r64 L: [memory dep.] T: 2.17ns= 2.17c
  64. Inst 40 CMOV : CMOVNZ r16, r16 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  65. Inst 41 CMOV : CMOVNZ r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  66. Inst 42 AMD64 : CMOVNZ r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  67. Inst 43 X86 : MOVSX r16, r8 L: 0.72ns= 1.5c T: 0.20ns= 0.41c
  68. Inst 44 X86 : MOVSX r32, r8 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  69. Inst 45 AMD64 : MOVSX r64, r8 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  70. Inst 46 X86 : MOVSX r32, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  71. Inst 47 AMD64 : MOVSX r64, r16 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  72. Inst 48 AMD64 : MOVSXD r64, r32 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  73. Inst 49 X86 : MOVZX r16, r8 L: 0.72ns= 1.5c T: 0.20ns= 0.41c
  74. Inst 50 X86 : MOVZX r32, r8 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  75. Inst 51 AMD64 : MOVZX r64, r8 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  76. Inst 52 X86 : MOVZX r32, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  77. Inst 53 AMD64 : MOVZX r64, r16 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  78. Inst 54 X86 : XCHG r8, r8 L: 1.43ns= 3.0c T: 0.20ns= 0.42c
  79. Inst 55 X86 : XCHG r16, r16 L: 1.43ns= 3.0c T: 0.37ns= 0.77c
  80. Inst 56 X86 : XCHG r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  81. Inst 57 AMD64 : XCHG r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  82. Inst 58 X86 : XCHG r1_8, r2_8 L: 1.43ns= 3.0c T: 0.35ns= 0.73c
  83. Inst 59 X86 : XCHG r1_16, r2_16 L: 0.72ns= 1.5c T: 0.33ns= 0.69c
  84. Inst 60 X86 : XCHG r1_32, r2_32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  85. Inst 61 AMD64 : XCHG r1_64, r2_64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  86. Inst 62 X86 : XCHG r8, [m8] L: 30.77ns= 64.5c T: 30.77ns= 64.50c
  87. Inst 63 X86 : XCHG r16, [m16] L: 30.77ns= 64.5c T: 30.77ns= 64.50c
  88. Inst 64 X86 : XCHG r32, [m32] L: 30.77ns= 64.5c T: 30.77ns= 64.50c
  89. Inst 65 AMD64 : XCHG r64, [m64] L: 30.77ns= 64.5c T: 30.77ns= 64.50c
  90. Inst 66 X86 : ADD r32, 0x04000 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  91. Inst 67 X86 : ADD r32, 0x08000 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  92. Inst 68 X86 : ADD r32, 0x10000 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  93. Inst 69 X86 : ADD r32, 0x20000 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  94. Inst 70 X86 : ADD r8, r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  95. Inst 71 X86 : ADD r16, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  96. Inst 72 X86 : ADD r32, r32 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  97. Inst 73 AMD64 : ADD r64, r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  98. Inst 74 X86 : ADD r8, [m8] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  99. Inst 75 X86 : ADD r16, [m16] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  100. Inst 76 X86 : ADD r32, [m32] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  101. Inst 77 AMD64 : ADD r64, [m64] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  102. Inst 78 X86 : ADD [m8], r8 L: 5.80ns= 12.2c T: 0.48ns= 1.00c
  103. Inst 79 X86 : ADD [m16], r16 L: 5.80ns= 12.2c T: 0.72ns= 1.50c
  104. Inst 80 X86 : ADD [m32], r32 L: 5.33ns= 11.2c T: 0.44ns= 0.92c
  105. Inst 81 X86 : ADD [m32 + 8], r32 L: 5.33ns= 11.2c T: 0.72ns= 1.50c
  106. Inst 82 AMD64 : ADD [m64], r64 L: 5.33ns= 11.2c T: 0.72ns= 1.50c
  107. Inst 83 AMD64 : ADD [m64 + 16], r64 L: 5.33ns= 11.2c T: 0.72ns= 1.50c
  108. Inst 84 X86 : LOCK ADD [m8], r8 L: 25.76ns= 54.0c T: 25.76ns= 54.00c
  109. Inst 85 X86 : LOCK ADD [m16], r16 L: 25.76ns= 54.0c T: 25.76ns= 54.00c
  110. Inst 86 X86 : LOCK ADD [m32], r32 L: 25.77ns= 54.0c T: 25.77ns= 54.00c
  111. Inst 87 X86 : LOCK ADD [m32 + 8], r32 L: 25.76ns= 54.0c T: 25.76ns= 54.00c
  112. Inst 88 AMD64 : LOCK ADD [m64], r64 L: 25.76ns= 54.0c T: 25.76ns= 54.00c
  113. Inst 89 AMD64 : LOCK ADD [m64 + 16], r64 L: 25.76ns= 54.0c T: 25.80ns= 54.08c
  114. Inst 90 X86 : ADD r8, imm8 L: 0.72ns= 1.5c T: 0.28ns= 0.59c
  115. Inst 91 X86 : ADD r16, imm8 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  116. Inst 92 X86 : ADD r32, imm8 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  117. Inst 93 AMD64 : ADD r64, imm8 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  118. Inst 94 X86 : ADD r16, imm16 L: 0.72ns= 1.5c T: 0.20ns= 0.42c
  119. Inst 95 X86 : ADD r32, imm32 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  120. Inst 96 AMD64 : ADD r64, imm32 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  121. Inst 97 X86 : ADD [m8], imm8 L: 5.80ns= 12.2c T: 0.72ns= 1.50c
  122. Inst 98 X86 : ADD [m16], imm8 L: 5.80ns= 12.2c T: 0.76ns= 1.58c
  123. Inst 99 X86 : ADD [m32], imm8 L: 5.33ns= 11.2c T: 0.48ns= 1.00c
  124. Inst 100 AMD64 : ADD [m64], imm8 L: 5.25ns= 11.0c T: 0.72ns= 1.50c
  125. Inst 101 X86 : ADD [m16], imm16 L: 5.80ns= 12.2c T: 0.76ns= 1.58c
  126. Inst 102 X86 : ADD [m32], imm32 L: 5.33ns= 11.2c T: 0.44ns= 0.92c
  127. Inst 103 AMD64 : ADD [m64], imm32 L: 5.33ns= 11.2c T: 0.72ns= 1.50c
  128. Inst 104 X86 : ADD al, imm8 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  129. Inst 105 X86 : ADD ax, imm16 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  130. Inst 106 X86 : ADD eax, imm32 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  131. Inst 107 AMD64 : ADD rax, imm32 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  132. Inst 108 X86 : SUB r8, r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  133. Inst 109 X86 : SUB r16, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  134. Inst 110 X86 : SUB r32, r32 L: 0.19ns= 0.4c T: 0.18ns= 0.38c
  135. Inst 111 AMD64 : SUB r64, r64 L: 0.19ns= 0.4c T: 0.18ns= 0.38c
  136. Inst 112 X86 : SUB r1_8, r2_8 L: 0.72ns= 1.5c T: 0.40ns= 0.84c
  137. Inst 113 X86 : SUB r1_16, r2_16 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  138. Inst 114 X86 : SUB r1_32, r2_32 L: 0.72ns= 1.5c T: 0.23ns= 0.49c
  139. Inst 115 AMD64 : SUB r1_64, r2_64 L: 0.72ns= 1.5c T: 0.19ns= 0.39c
  140. Inst 116 X86 : ADC r8, r8 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  141. Inst 117 X86 : ADC r16, r16 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  142. Inst 118 X86 : ADC r32, r32 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  143. Inst 119 AMD64 : ADC r64, r64 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  144. Inst 120 X86 : SBB r8, r8 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  145. Inst 121 X86 : SBB r16, r16 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  146. Inst 122 X86 : SBB r32, r32 L: 0.64ns= 1.3c T: 0.64ns= 1.33c
  147. Inst 123 AMD64 : SBB r64, r64 L: 0.64ns= 1.3c T: 0.64ns= 1.33c
  148. Inst 124 X86 : SBB r1_8, r2_8 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  149. Inst 125 X86 : SBB r1_16, r2_16 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  150. Inst 126 X86 : SBB r1_32, r2_32 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  151. Inst 127 AMD64 : SBB r1_64, r2_64 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  152. Inst 128 X86 : CMP r8, r8 L: [no true dep.] T: 0.18ns= 0.38c
  153. Inst 129 X86 : CMP r16, r16 L: [no true dep.] T: 0.18ns= 0.39c
  154. Inst 130 X86 : CMP r32, r32 L: [no true dep.] T: 0.18ns= 0.39c
  155. Inst 131 AMD64 : CMP r64, r64 L: [no true dep.] T: 0.18ns= 0.39c
  156. Inst 132 X86 : CMP r1_8, r2_8 L: [no true dep.] T: 0.18ns= 0.39c
  157. Inst 133 X86 : CMP r1_16, r2_16 L: [no true dep.] T: 0.19ns= 0.40c
  158. Inst 134 X86 : CMP r1_32, r2_32 L: [no true dep.] T: 0.18ns= 0.38c
  159. Inst 135 AMD64 : CMP r1_64, r2_64 L: [no true dep.] T: 0.18ns= 0.39c
  160. Inst 136 X86 : AND r8, r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  161. Inst 137 X86 : AND r16, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  162. Inst 138 X86 : AND r32, r32 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  163. Inst 139 AMD64 : AND r64, r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  164. Inst 140 X86 : AND r1_8, r2_8 L: 0.72ns= 1.5c T: 0.40ns= 0.84c
  165. Inst 141 X86 : AND r1_16, r2_16 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  166. Inst 142 X86 : AND r1_32, r2_32 L: 0.72ns= 1.5c T: 0.23ns= 0.49c
  167. Inst 143 AMD64 : AND r1_64, r2_64 L: 0.72ns= 1.5c T: 0.19ns= 0.39c
  168. Inst 144 X86 : OR r8, r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  169. Inst 145 X86 : OR r16, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  170. Inst 146 X86 : OR r32, r32 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  171. Inst 147 AMD64 : OR r64, r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  172. Inst 148 X86 : OR r1_8, r2_8 L: 0.72ns= 1.5c T: 0.40ns= 0.84c
  173. Inst 149 X86 : OR r1_16, r2_16 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  174. Inst 150 X86 : OR r1_32, r2_32 L: 0.72ns= 1.5c T: 0.23ns= 0.49c
  175. Inst 151 AMD64 : OR r1_64, r2_64 L: 0.72ns= 1.5c T: 0.19ns= 0.39c
  176. Inst 152 X86 : XOR r8, r8 L: 0.72ns= 1.5c T: 0.27ns= 0.57c
  177. Inst 153 X86 : XOR r16, r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  178. Inst 154 X86 : XOR r32, r32 L: 0.19ns= 0.4c T: 0.18ns= 0.38c
  179. Inst 155 AMD64 : XOR r64, r64 L: 0.19ns= 0.4c T: 0.18ns= 0.39c
  180. Inst 156 X86 : XOR r1_8, r2_8 L: 0.72ns= 1.5c T: 0.40ns= 0.84c
  181. Inst 157 X86 : XOR r1_16, r2_16 L: 0.72ns= 1.5c T: 0.24ns= 0.50c
  182. Inst 158 X86 : XOR r1_32, r2_32 L: 0.72ns= 1.5c T: 0.23ns= 0.49c
  183. Inst 159 AMD64 : XOR r1_64, r2_64 L: 0.72ns= 1.5c T: 0.19ns= 0.39c
  184. Inst 160 X86 : NEG r8 L: 0.72ns= 1.5c T: 0.26ns= 0.55c
  185. Inst 161 X86 : NEG r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  186. Inst 162 X86 : NEG r32 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  187. Inst 163 AMD64 : NEG r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  188. Inst 164 X86 : NOT r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  189. Inst 165 X86 : NOT r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  190. Inst 166 X86 : NOT r32 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  191. Inst 167 AMD64 : NOT r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  192. Inst 168 X86 : TEST r8, r8 L: [no true dep.] T: 0.18ns= 0.38c
  193. Inst 169 X86 : TEST r16, r16 L: [no true dep.] T: 0.18ns= 0.38c
  194. Inst 170 X86 : TEST r32, r32 L: [no true dep.] T: 0.18ns= 0.39c
  195. Inst 171 AMD64 : TEST r64, r64 L: [no true dep.] T: 0.18ns= 0.38c
  196. Inst 172 X86 : TEST r1_8, r2_8 L: [no true dep.] T: 0.18ns= 0.38c
  197. Inst 173 X86 : TEST r1_16, r2_16 L: [no true dep.] T: 0.19ns= 0.39c
  198. Inst 174 X86 : TEST r1_32, r2_32 L: [no true dep.] T: 0.18ns= 0.38c
  199. Inst 175 AMD64 : TEST r1_64, r2_64 L: [no true dep.] T: 0.19ns= 0.39c
  200. Inst 176 X86 : BT r16, r16 L: [no true dep.] T: 0.36ns= 0.75c
  201. Inst 177 X86 : BT r32, r32 L: [no true dep.] T: 0.36ns= 0.75c
  202. Inst 178 AMD64 : BT r64, r64 L: [no true dep.] T: 0.36ns= 0.75c
  203. Inst 179 X86 : BT r16, imm8 L: [no true dep.] T: 0.36ns= 0.75c
  204. Inst 180 X86 : BT r32, imm8 L: [no true dep.] T: 0.36ns= 0.75c
  205. Inst 181 AMD64 : BT r64, imm8 L: [no true dep.] T: 0.36ns= 0.75c
  206. Inst 182 X86 : BTC r16, r16 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  207. Inst 183 X86 : BTC r32, r32 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  208. Inst 184 AMD64 : BTC r64, r64 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  209. Inst 185 X86 : BTC r16, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  210. Inst 186 X86 : BTC r32, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  211. Inst 187 AMD64 : BTC r64, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  212. Inst 188 X86 : BTR r16, r16 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  213. Inst 189 X86 : BTR r32, r32 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  214. Inst 190 AMD64 : BTR r64, r64 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  215. Inst 191 X86 : BTR r16, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  216. Inst 192 X86 : BTR r32, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  217. Inst 193 AMD64 : BTR r64, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  218. Inst 194 X86 : BTS r16, r16 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  219. Inst 195 X86 : BTS r32, r32 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  220. Inst 196 AMD64 : BTS r64, r64 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  221. Inst 197 X86 : BTS r16, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  222. Inst 198 X86 : BTS r32, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  223. Inst 199 AMD64 : BTS r64, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  224. Inst 200 X86 : SETC r8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  225. Inst 201 X86 : INC r8 L: 0.72ns= 1.5c T: 0.27ns= 0.56c
  226. Inst 202 X86 : INC r16 L: 0.72ns= 1.5c T: 0.20ns= 0.43c
  227. Inst 203 X86 : INC r32 L: 0.72ns= 1.5c T: 0.19ns= 0.41c
  228. Inst 204 AMD64 : INC r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  229. Inst 205 X86 : LEA r16, [r16+r16] L: 1.43ns= 3.0c T: 0.33ns= 0.70c
  230. Inst 206 X86 : LEA r32, [r32+r32] L: 0.72ns= 1.5c T: 0.21ns= 0.43c
  231. Inst 207 AMD64 : LEA r64, [r64+r64] L: 0.72ns= 1.5c T: 0.20ns= 0.41c
  232. Inst 208 X86 : LEA r16, [r+r+disp8] L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  233. Inst 209 X86 : LEA r32, [r+r+disp8] L: 0.72ns= 1.5c T: 0.39ns= 0.81c
  234. Inst 210 AMD64 : LEA r64, [r+r+disp8] L: 0.72ns= 1.5c T: 0.37ns= 0.77c
  235. Inst 211 X86 : LEA r16, [r+r*8] L: 1.43ns= 3.0c T: 0.37ns= 0.77c
  236. Inst 212 X86 : LEA r32, [r+r*8] L: 0.72ns= 1.5c T: 0.37ns= 0.78c
  237. Inst 213 AMD64 : LEA r64, [r+r*8] L: 0.72ns= 1.5c T: 0.36ns= 0.76c
  238. Inst 214 X86 : LEA r16, [r+r*8+disp8] L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  239. Inst 215 X86 : LEA r32, [r+r*8+disp8] L: 0.72ns= 1.5c T: 0.39ns= 0.81c
  240. Inst 216 AMD64 : LEA r64, [r+r*8+disp8] L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  241. Inst 217 X86 : SHL r8, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  242. Inst 218 X86 : SHL r16, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  243. Inst 219 X86 : SHL r32, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  244. Inst 220 AMD64 : SHL r64, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  245. Inst 221 X86 : SHL r8, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  246. Inst 222 X86 : SHL r16, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  247. Inst 223 X86 : SHL r32, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  248. Inst 224 AMD64 : SHL r64, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  249. Inst 225 X86 : SHL r8, cl L: 0.72ns= 1.5c T: 0.35ns= 0.74c
  250. Inst 226 X86 : SHL r16, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  251. Inst 227 X86 : SHL r32, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  252. Inst 228 AMD64 : SHL r64, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  253. Inst 229 X86 : SHR r8, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  254. Inst 230 X86 : SHR r16, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  255. Inst 231 X86 : SHR r32, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  256. Inst 232 AMD64 : SHR r64, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  257. Inst 233 X86 : SHR r8, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  258. Inst 234 X86 : SHR r16, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  259. Inst 235 X86 : SHR r32, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  260. Inst 236 AMD64 : SHR r64, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  261. Inst 237 X86 : SHR r8, cl L: 0.72ns= 1.5c T: 0.35ns= 0.74c
  262. Inst 238 X86 : SHR r16, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  263. Inst 239 X86 : SHR r32, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  264. Inst 240 AMD64 : SHR r64, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  265. Inst 241 X86 : SAR r8, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  266. Inst 242 X86 : SAR r16, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  267. Inst 243 X86 : SAR r32, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  268. Inst 244 AMD64 : SAR r64, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  269. Inst 245 X86 : SAR r8, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  270. Inst 246 X86 : SAR r16, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  271. Inst 247 X86 : SAR r32, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  272. Inst 248 AMD64 : SAR r64, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  273. Inst 249 X86 : SAR r8, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  274. Inst 250 X86 : SAR r16, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  275. Inst 251 X86 : SAR r32, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  276. Inst 252 AMD64 : SAR r64, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  277. Inst 253 X86 : SHLD r16, r16, imm8 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  278. Inst 254 X86 : SHLD r32, r32, imm8 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  279. Inst 255 AMD64 : SHLD r64, r64, imm8 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  280. Inst 256 X86 : SHLD r16, r16, cl L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  281. Inst 257 X86 : SHLD r32, r32, cl L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  282. Inst 258 AMD64 : SHLD r64, r64, cl L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  283. Inst 259 X86 : SHRD r16, r16, imm8 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  284. Inst 260 X86 : SHRD r32, r32, imm8 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  285. Inst 261 AMD64 : SHRD r64, r64, imm8 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  286. Inst 262 X86 : SHRD r16, r16, cl L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  287. Inst 263 X86 : SHRD r32, r32, cl L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  288. Inst 264 AMD64 : SHRD r64, r64, cl L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  289. Inst 265 X86 : ROL r8, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  290. Inst 266 X86 : ROL r16, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  291. Inst 267 X86 : ROL r32, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  292. Inst 268 AMD64 : ROL r64, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  293. Inst 269 X86 : ROL r8, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  294. Inst 270 X86 : ROL r16, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  295. Inst 271 X86 : ROL r32, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  296. Inst 272 AMD64 : ROL r64, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  297. Inst 273 X86 : ROL r8, cl L: 0.72ns= 1.5c T: 0.35ns= 0.74c
  298. Inst 274 X86 : ROL r16, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  299. Inst 275 X86 : ROL r32, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  300. Inst 276 AMD64 : ROL r64, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  301. Inst 277 X86 : ROR r8, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  302. Inst 278 X86 : ROR r16, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  303. Inst 279 X86 : ROR r32, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  304. Inst 280 AMD64 : ROR r64, 1 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  305. Inst 281 X86 : ROR r8, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  306. Inst 282 X86 : ROR r16, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  307. Inst 283 X86 : ROR r32, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  308. Inst 284 AMD64 : ROR r64, imm8 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  309. Inst 285 X86 : ROR r8, cl L: 0.72ns= 1.5c T: 0.35ns= 0.74c
  310. Inst 286 X86 : ROR r16, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  311. Inst 287 X86 : ROR r32, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  312. Inst 288 AMD64 : ROR r64, cl L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  313. Inst 289 X86 : RCL r8, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  314. Inst 290 X86 : RCL r16, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  315. Inst 291 X86 : RCL r32, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  316. Inst 292 AMD64 : RCL r64, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  317. Inst 293 X86 : RCL r8, imm8 L: 6.60ns= 13.8c T: 6.28ns= 13.17c
  318. Inst 294 X86 : RCL r16, imm8 L: 5.01ns= 10.5c T: 5.01ns= 10.50c
  319. Inst 295 X86 : RCL r32, imm8 L: 3.58ns= 7.5c T: 3.58ns= 7.50c
  320. Inst 296 AMD64 : RCL r64, imm8 L: 3.58ns= 7.5c T: 3.58ns= 7.50c
  321. Inst 297 X86 : RCL r8, cl L: 6.88ns= 14.4c T: 6.60ns= 13.83c
  322. Inst 298 X86 : RCL r16, cl L: 6.24ns= 13.1c T: 5.73ns= 12.00c
  323. Inst 299 X86 : RCL r32, cl L: 4.29ns= 9.0c T: 4.29ns= 9.00c
  324. Inst 300 AMD64 : RCL r64, cl L: 4.29ns= 9.0c T: 4.29ns= 9.00c
  325. Inst 301 X86 : RCR r8, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  326. Inst 302 X86 : RCR r16, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  327. Inst 303 X86 : RCR r32, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  328. Inst 304 AMD64 : RCR r64, 1 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  329. Inst 305 X86 : RCR r8, imm8 L: 6.08ns= 12.8c T: 5.84ns= 12.25c
  330. Inst 306 X86 : RCR r16, imm8 L: 5.01ns= 10.5c T: 5.01ns= 10.50c
  331. Inst 307 X86 : RCR r32, imm8 L: 3.58ns= 7.5c T: 3.58ns= 7.50c
  332. Inst 308 AMD64 : RCR r64, imm8 L: 3.58ns= 7.5c T: 3.58ns= 7.50c
  333. Inst 309 X86 : RCR r8, cl L: 6.16ns= 12.9c T: 6.04ns= 12.67c
  334. Inst 310 X86 : RCR r16, cl L: 5.09ns= 10.7c T: 5.01ns= 10.50c
  335. Inst 311 X86 : RCR r32, cl L: 4.29ns= 9.0c T: 3.58ns= 7.50c
  336. Inst 312 AMD64 : RCR r64, cl L: 4.29ns= 9.0c T: 3.58ns= 7.50c
  337. Inst 313 X86 : BSF r16, r16 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  338. Inst 314 X86 : BSF r32, r32 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  339. Inst 315 AMD64 : BSF r64, r64 L: 2.86ns= 6.0c T: 2.15ns= 4.50c
  340. Inst 316 X86 : BSR r16, r16 L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  341. Inst 317 X86 : BSR r32, r32 L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  342. Inst 318 AMD64 : BSR r64, r64 L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  343. Inst 319 X86 : BSWAP r32 L: 0.72ns= 1.5c T: 0.19ns= 0.41c
  344. Inst 320 AMD64 : BSWAP r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  345. Inst 321 MOVBE : MOVBE r16, [m16] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  346. Inst 322 MOVBE : MOVBE r32, [m32] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  347. Inst 323 MOVBE : MOVBE r64, [m64] L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  348. Inst 324 MOVBE : MOVBE [m16], r16 L: [memory dep.] T: 0.72ns= 1.50c
  349. Inst 325 MOVBE : MOVBE [m32], r32 L: [memory dep.] T: 0.72ns= 1.50c
  350. Inst 326 MOVBE : MOVBE [m64], r64 L: [memory dep.] T: 0.72ns= 1.50c
  351. Inst 327 X86 : IMUL r16, r16 L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  352. Inst 328 X86 : IMUL r32, r32 L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  353. Inst 329 AMD64 : IMUL r64, r64 L: 4.29ns= 9.0c T: 2.86ns= 6.00c
  354. Inst 330 X86 : IMUL r16, r16, imm8 L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  355. Inst 331 X86 : IMUL r32, r32, imm8 L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  356. Inst 332 AMD64 : IMUL r64, r64, imm8 L: 4.29ns= 9.0c T: 2.86ns= 6.00c
  357. Inst 333 X86 : IMUL r16, r16, imm16 L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  358. Inst 334 X86 : IMUL r32, r32, imm32 L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  359. Inst 335 AMD64 : IMUL r64, r64, imm32 L: 4.29ns= 9.0c T: 2.90ns= 6.08c
  360. Inst 336 X86 : IMUL r8 (ah) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  361. Inst 337 X86 : IMUL r16 (dx) L: 4.29ns= 9.0c T: 2.86ns= 6.00c
  362. Inst 338 X86 : IMUL r32 (edx) L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  363. Inst 339 AMD64 : IMUL r64 (rdx) L: 5.01ns= 10.5c T: 4.29ns= 9.00c
  364. Inst 340 X86 : MUL r8 (ah) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  365. Inst 341 X86 : MUL r16 (dx) L: 4.29ns= 9.0c T: 2.86ns= 6.00c
  366. Inst 342 X86 : MUL r32 (edx) L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  367. Inst 343 AMD64 : MUL r64 (rdx) L: 5.01ns= 10.5c T: 4.29ns= 9.00c
  368. Inst 344 X86 : IMUL r8 (al) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  369. Inst 345 X86 : IMUL r16 (ax) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  370. Inst 346 X86 : IMUL r32 (eax) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  371. Inst 347 AMD64 : IMUL r64 (rax) L: 4.29ns= 9.0c T: 4.29ns= 9.00c
  372. Inst 348 X86 : MUL r8 (al) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  373. Inst 349 X86 : MUL r16 (ax) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  374. Inst 350 X86 : MUL r32 (eax) L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  375. Inst 351 AMD64 : MUL r64 (rax) L: 4.29ns= 9.0c T: 4.29ns= 9.00c
  376. Inst 352 X86 : IDIV r8 14/ 7b (full) L: 15.74ns= 33.0c T: 15.74ns= 33.00c
  377. Inst 353 X86 : IDIV r8 12/ 7b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  378. Inst 354 X86 : IDIV r8 7/ 7b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  379. Inst 355 X86 : IDIV r8 4/ 7b ax upd L: [no true dep.] T: 13.60ns= 28.50c
  380. Inst 356 X86 : IDIV r8 0/ 7b L: [no true dep.] T: 12.88ns= 27.00c
  381. Inst 357 X86 : IDIV r8 11/ 4b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  382. Inst 358 X86 : IDIV r8 8/ 4b ax upd L: [no true dep.] T: 13.60ns= 28.50c
  383. Inst 359 X86 : IDIV r8 4/ 4b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  384. Inst 360 X86 : IDIV r8 0/ 4b L: [no true dep.] T: 12.88ns= 27.00c
  385. Inst 361 X86 : IDIV r8 2^12/2^6 ax upd L: [no true dep.] T: 13.60ns= 28.50c
  386. Inst 362 X86 : IDIV r8 1/1 L: 12.88ns= 27.0c T: 12.88ns= 27.00c
  387. Inst 363 X86 : IDIV r8 1/1 ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  388. Inst 364 X86 : IDIV r16 30/15b (full) L: 19.68ns= 41.3c T: 19.32ns= 40.50c
  389. Inst 365 X86 : IDIV r16 24/15b ax upd L: 15.74ns= 33.0c T: 15.39ns= 32.25c
  390. Inst 366 X86 : IDIV r16 15/15b ax upd L: 11.45ns= 24.0c T: 11.09ns= 23.25c
  391. Inst 367 X86 : IDIV r16 8/15b ax/dx upd L: [no true dep.] T: 11.45ns= 24.00c
  392. Inst 368 X86 : IDIV r16 0/15b L: [no true dep.] T: 10.89ns= 22.83c
  393. Inst 369 X86 : IDIV r16 23/ 8b ax upd L: 20.04ns= 42.0c T: 19.68ns= 41.25c
  394. Inst 370 X86 : IDIV r16 16/ 8b ax upd L: [no true dep.] T: 14.63ns= 30.67c
  395. Inst 371 X86 : IDIV r16 8/ 8b ax upd L: 11.45ns= 24.0c T: 11.09ns= 23.25c
  396. Inst 372 X86 : IDIV r16 0/ 8b L: [no true dep.] T: 10.89ns= 22.83c
  397. Inst 373 X86 : IDIV r16 2^28/2^14 ax/dx L: [no true dep.] T: 19.32ns= 40.50c
  398. Inst 374 X86 : IDIV r16 1/1 L: 11.09ns= 23.3c T: 10.89ns= 22.83c
  399. Inst 375 X86 : IDIV r16 1/1 ax upd L: 11.45ns= 24.0c T: 11.09ns= 23.25c
  400. Inst 376 X86 : IDIV r16 1/1 ax/dx upd L: 11.81ns= 24.8c T: 11.49ns= 24.08c
  401. Inst 377 X86 : IDIV r32 62/31b (full) L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  402. Inst 378 X86 : IDIV r32 62/31b 0 rem. L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  403. Inst 379 X86 : IDIV r32 48/31b eax upd L: 20.04ns= 42.0c T: 20.04ns= 42.00c
  404. Inst 380 X86 : IDIV r32 31/31b eax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  405. Inst 381 X86 : IDIV r32 16/31b eax/edx L: [no true dep.] T: 10.38ns= 21.75c
  406. Inst 382 X86 : IDIV r32 0/31b L: [no true dep.] T: 10.02ns= 21.00c
  407. Inst 383 X86 : IDIV r32 47/16b eax upd L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  408. Inst 384 X86 : IDIV r32 32/16b eax upd L: [no true dep.] T: 19.32ns= 40.50c
  409. Inst 385 X86 : IDIV r32 16/16b eax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  410. Inst 386 X86 : IDIV r32 0/16b L: [no true dep.] T: 10.02ns= 21.00c
  411. Inst 387 X86 : IDIV r32 2^60/2^30 eax/edx L: [no true dep.] T: 29.34ns= 61.50c
  412. Inst 388 X86 : IDIV r32 1/1 L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  413. Inst 389 X86 : IDIV r32 1/1 eax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  414. Inst 390 X86 : IDIV r32 1/1 eax/edx upd L: 10.38ns= 21.8c T: 10.38ns= 21.75c
  415. Inst 391 AMD64 : IDIV r64 126/63b (full) L: 52.96ns=111.0c T: 52.96ns=111.00c
  416. Inst 392 AMD64 : IDIV r64 126/63b 0 rem. L: 52.96ns=111.0c T: 52.96ns=111.00c
  417. Inst 393 AMD64 : IDIV r64 96/63b rax upd L: 31.49ns= 66.0c T: 31.49ns= 66.00c
  418. Inst 394 AMD64 : IDIV r64 63/63b rax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  419. Inst 395 AMD64 : IDIV r64 32/63b rax/rdx L: [no true dep.] T: 10.10ns= 21.17c
  420. Inst 396 AMD64 : IDIV r64 0/63b L: [no true dep.] T: 10.02ns= 21.00c
  421. Inst 397 AMD64 : IDIV r64 95/32b rax upd L: 52.96ns=111.0c T: 52.96ns=111.00c
  422. Inst 398 AMD64 : IDIV r64 64/32b rax upd L: [no true dep.] T: 30.77ns= 64.50c
  423. Inst 399 AMD64 : IDIV r64 32/32b rax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  424. Inst 400 AMD64 : IDIV r64 0/32b L: [no true dep.] T: 10.02ns= 21.00c
  425. Inst 401 AMD64 : IDIV r64 2^124/2^62 rax/rdx L: [no true dep.] T: 52.28ns=109.58c
  426. Inst 402 AMD64 : IDIV r64 1/1 L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  427. Inst 403 AMD64 : IDIV r64 1/1 rax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  428. Inst 404 AMD64 : IDIV r64 1/1 rax/rdx upd L: 10.18ns= 21.3c T: 10.18ns= 21.33c
  429. Inst 405 X86 : DIV r8 16/ 8b (full) L: 15.74ns= 33.0c T: 15.74ns= 33.00c
  430. Inst 406 X86 : DIV r8 12/ 8b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  431. Inst 407 X86 : DIV r8 8/ 8b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  432. Inst 408 X86 : DIV r8 4/ 8b ax upd L: [no true dep.] T: 13.60ns= 28.50c
  433. Inst 409 X86 : DIV r8 0/ 8b L: [no true dep.] T: 12.88ns= 27.00c
  434. Inst 410 X86 : DIV r8 12/ 4b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  435. Inst 411 X86 : DIV r8 8/ 4b ax upd L: [no true dep.] T: 13.60ns= 28.50c
  436. Inst 412 X86 : DIV r8 4/ 4b ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  437. Inst 413 X86 : DIV r8 0/ 4b L: [no true dep.] T: 12.88ns= 27.00c
  438. Inst 414 X86 : DIV r8 2^14/2^7 ax upd L: [no true dep.] T: 13.60ns= 28.50c
  439. Inst 415 X86 : DIV r8 1/1 L: 12.88ns= 27.0c T: 12.88ns= 27.00c
  440. Inst 416 X86 : DIV r8 1/1 ax upd L: 13.60ns= 28.5c T: 13.60ns= 28.50c
  441. Inst 417 X86 : DIV r16 32/16b (full) L: 19.68ns= 41.3c T: 19.32ns= 40.50c
  442. Inst 418 X86 : DIV r16 30/15b 0 rem. L: 19.68ns= 41.3c T: 19.32ns= 40.50c
  443. Inst 419 X86 : DIV r16 24/16b ax upd L: 15.03ns= 31.5c T: 14.63ns= 30.67c
  444. Inst 420 X86 : DIV r16 16/16b ax upd L: 11.45ns= 24.0c T: 11.09ns= 23.25c
  445. Inst 421 X86 : DIV r16 8/16b ax/dx upd L: [no true dep.] T: 11.49ns= 24.08c
  446. Inst 422 X86 : DIV r16 0/16b L: [no true dep.] T: 10.85ns= 22.75c
  447. Inst 423 X86 : DIV r16 24/ 8b ax upd L: 20.04ns= 42.0c T: 19.68ns= 41.25c
  448. Inst 424 X86 : DIV r16 16/ 8b ax upd L: [no true dep.] T: 14.63ns= 30.67c
  449. Inst 425 X86 : DIV r16 8/ 8b ax upd L: 11.45ns= 24.0c T: 11.09ns= 23.25c
  450. Inst 426 X86 : DIV r16 0/ 8b L: [no true dep.] T: 10.85ns= 22.75c
  451. Inst 427 X86 : DIV r16 1/1 L: 11.09ns= 23.3c T: 10.85ns= 22.75c
  452. Inst 428 X86 : DIV r16 1/1 ax upd L: 11.45ns= 24.0c T: 11.09ns= 23.25c
  453. Inst 429 X86 : DIV r16 1/1 ax/dx upd L: 11.81ns= 24.8c T: 11.49ns= 24.08c
  454. Inst 430 X86 : DIV r32 64/32b (full) L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  455. Inst 431 X86 : DIV r32 62/31b 0 rem. L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  456. Inst 432 X86 : DIV r32 48/32b eax upd L: 19.32ns= 40.5c T: 19.32ns= 40.50c
  457. Inst 433 X86 : DIV r32 32/32b eax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  458. Inst 434 X86 : DIV r32 16/32b eax/edx L: [no true dep.] T: 10.38ns= 21.75c
  459. Inst 435 X86 : DIV r32 0/32b L: [no true dep.] T: 10.02ns= 21.00c
  460. Inst 436 X86 : DIV r32 48/16b eax upd L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  461. Inst 437 X86 : DIV r32 32/16b eax upd L: [no true dep.] T: 19.32ns= 40.50c
  462. Inst 438 X86 : DIV r32 16/16b eax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  463. Inst 439 X86 : DIV r32 0/16b L: [no true dep.] T: 10.02ns= 21.00c
  464. Inst 440 X86 : DIV r32 2^62/2^31 eax/edx L: [no true dep.] T: 30.06ns= 63.00c
  465. Inst 441 X86 : DIV r32 1/1 L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  466. Inst 442 X86 : DIV r32 1/1 eax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  467. Inst 443 X86 : DIV r32 1/1 eax/edx upd L: 10.38ns= 21.8c T: 10.38ns= 21.75c
  468. Inst 444 AMD64 : DIV r64 128/64b (full) L: 52.96ns=111.0c T: 52.96ns=111.00c
  469. Inst 445 AMD64 : DIV r64 126/63b 0 rem. L: 52.96ns=111.0c T: 52.96ns=111.00c
  470. Inst 446 AMD64 : DIV r64 96/64b rax upd L: 30.77ns= 64.5c T: 30.77ns= 64.50c
  471. Inst 447 AMD64 : DIV r64 64/64b rax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  472. Inst 448 AMD64 : DIV r64 32/64b rax/rdx L: [no true dep.] T: 10.22ns= 21.42c
  473. Inst 449 AMD64 : DIV r64 0/64b L: [no true dep.] T: 10.02ns= 21.00c
  474. Inst 450 AMD64 : DIV r64 96/32b rax upd L: 52.96ns=111.0c T: 52.96ns=111.00c
  475. Inst 451 AMD64 : DIV r64 64/32b rax upd L: [no true dep.] T: 30.81ns= 64.58c
  476. Inst 452 AMD64 : DIV r64 32/32b rax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  477. Inst 453 AMD64 : DIV r64 0/32b L: [no true dep.] T: 10.02ns= 21.00c
  478. Inst 454 AMD64 : DIV r64 2^126/2^63 rax/rdx L: [no true dep.] T: 53.00ns=111.08c
  479. Inst 455 AMD64 : DIV r64 1/1 L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  480. Inst 456 AMD64 : DIV r64 1/1 rax upd L: 10.02ns= 21.0c T: 10.02ns= 21.00c
  481. Inst 457 AMD64 : DIV r64 1/1 rax/rdx upd L: 10.18ns= 21.3c T: 10.18ns= 21.33c
  482. Inst 458 X86 : CBW L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  483. Inst 459 X86 : CWDE L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  484. Inst 460 AMD64 : CDQE L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  485. Inst 461 X86 : CWD L: 0.80ns= 1.7c T: 0.76ns= 1.58c
  486. Inst 462 X86 : CDQ L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  487. Inst 463 AMD64 : CQO L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  488. Inst 464 X86 : CLC L: 0.19ns= 0.4c T: 0.18ns= 0.39c
  489. Inst 465 X86 : STC L: 0.19ns= 0.4c T: 0.18ns= 0.39c
  490. Inst 466 X86 : CMC L: 0.64ns= 1.3c T: 0.64ns= 1.33c
  491. Inst 467 X86 : CLD L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  492. Inst 468 X86 : STD L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  493. Inst 475 LAHF : LAHF L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  494. Inst 476 LAHF : SAHF L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  495. Inst 483 X86 : PUSH r16 L: [no true dep.] T: 0.72ns= 1.50c
  496. Inst 484 X86 : POP r16 L: [no true dep.] T: 0.72ns= 1.50c
  497. Inst 485 X86 : PUSH r16 + POP r16 L: 8.43ns= 17.7c T: 0.56ns= 1.17c
  498. Inst 486 AMD64 : PUSH r64 L: [no true dep.] T: 0.80ns= 1.67c
  499. Inst 487 AMD64 : POP r64 L: [no true dep.] T: 0.72ns= 1.50c
  500. Inst 488 AMD64 : PUSH r64 + POP r64 L: 3.34ns= 7.0c T: 1.47ns= 3.08c
  501. Inst 489 AMD64 : PUSH imm8 L: [no true dep.] T: 0.83ns= 1.75c
  502. Inst 490 AMD64 : PUSH imm8 + POP r64 L: 1.35ns= 2.8c T: 1.43ns= 3.00c
  503. Inst 491 AMD64 : PUSH imm32 L: [no true dep.] T: 0.83ns= 1.75c
  504. Inst 492 AMD64 : PUSH imm32 + POP r64 L: 1.39ns= 2.9c T: 1.43ns= 3.00c
  505. Inst 493 X86 : PUSH [m16] L: [no true dep.] T: 0.83ns= 1.75c
  506. Inst 494 X86 : POP [m16] L: [no true dep.] T: 0.87ns= 1.83c
  507. Inst 495 X86 : PUSH [m16] + POP [m16] L: 11.45ns= 24.0c T: 1.35ns= 2.83c
  508. Inst 496 AMD64 : PUSH [m64] L: [no true dep.] T: 0.87ns= 1.83c
  509. Inst 497 AMD64 : POP [m64] L: [no true dep.] T: 0.80ns= 1.67c
  510. Inst 498 AMD64 : PUSH [m64] + POP [m64] L: 10.66ns= 22.3c T: 1.19ns= 2.50c
  511. Inst 499 X86 : PUSHF L: [no true dep.] T: 2.15ns= 4.50c
  512. Inst 501 X86 : PUSHF + POPF L: 17.61ns= 36.9c T: 17.61ns= 36.92c
  513. Inst 502 AMD64 : PUSHFQ L: [no true dep.] T: 2.15ns= 4.50c
  514. Inst 504 AMD64 : PUSHFQ + POPFQ L: 17.61ns= 36.9c T: 17.61ns= 36.92c
  515. Inst 505 X86 : CMPSB L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  516. Inst 506 X86 : CMPSW L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  517. Inst 507 X86 : CMPSD L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  518. Inst 508 AMD64 : CMPSQ L: 2.15ns= 4.5c T: 2.19ns= 4.58c
  519. Inst 509 X86 : REPE CMPSB BW in L1D: 0.33 B/c 684MiB/s
  520. Inst 510 X86 : REPE CMPSW BW in L1D: 0.65 B/c 1365MiB/s
  521. Inst 511 X86 : REPE CMPSD BW in L1D: 1.30 B/c 2725MiB/s
  522. Inst 512 AMD64 : REPE CMPSQ BW in L1D: 2.59 B/c 5432MiB/s
  523. Inst 513 X86 : LODSB L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  524. Inst 514 X86 : LODSW L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  525. Inst 515 X86 : LODSD L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  526. Inst 516 AMD64 : LODSQ L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  527. Inst 517 X86 : REP LODSB BW in L1D: 0.31 B/c 642MiB/s
  528. Inst 518 X86 : REP LODSW BW in L1D: 0.61 B/c 1284MiB/s
  529. Inst 519 X86 : REP LODSD BW in L1D: 1.28 B/c 2681MiB/s
  530. Inst 520 AMD64 : REP LODSQ BW in L1D: 2.55 B/c 5352MiB/s
  531. Inst 521 X86 : STOSB L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  532. Inst 522 X86 : STOSW L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  533. Inst 523 X86 : STOSD L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  534. Inst 524 AMD64 : STOSQ L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  535. Inst 525 X86 : REP STOSB BW in L1D: 5.52 B/c 11574MiB/s
  536. Inst 526 X86 : REP STOSW BW in L1D: 5.52 B/c 11576MiB/s
  537. Inst 527 X86 : REP STOSD BW in L1D: 5.53 B/c 11598MiB/s
  538. Inst 528 AMD64 : REP STOSQ BW in L1D: 5.54 B/c 11610MiB/s
  539. Inst 529 X86 : MOVSB L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  540. Inst 530 X86 : MOVSW L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  541. Inst 531 X86 : MOVSD L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  542. Inst 532 AMD64 : MOVSQ L: 2.15ns= 4.5c T: 2.19ns= 4.58c
  543. Inst 533 X86 : REP MOVSB BW in L1D:11.07 B/c 23194MiB/s
  544. Inst 534 X86 : REP MOVSW BW in L1D:10.98 B/c 23025MiB/s
  545. Inst 535 X86 : REP MOVSD BW in L1D:10.98 B/c 23010MiB/s
  546. Inst 536 AMD64 : REP MOVSQ BW in L1D:11.00 B/c 23063MiB/s
  547. Inst 537 X86 : SCASB L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  548. Inst 538 X86 : SCASW L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  549. Inst 539 X86 : SCASD L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  550. Inst 540 AMD64 : SCASQ L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  551. Inst 541 X86 : REPNE SCASB BW in L1D: 0.20 B/c 414MiB/s
  552. Inst 542 X86 : REPNE SCASW BW in L1D: 0.40 B/c 828MiB/s
  553. Inst 543 X86 : REPNE SCASD BW in L1D: 0.80 B/c 1682MiB/s
  554. Inst 544 AMD64 : REPNE SCASQ BW in L1D: 1.58 B/c 3310MiB/s
  555. Inst 545 X86 : XADD r8, r8 L: 1.43ns= 3.0c T: 0.21ns= 0.45c
  556. Inst 546 X86 : XADD r16, r16 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  557. Inst 547 X86 : XADD r32, r32 L: 1.07ns= 2.3c T: 0.36ns= 0.75c
  558. Inst 548 AMD64 : XADD r64, r64 L: 1.07ns= 2.3c T: 0.36ns= 0.75c
  559. Inst 549 X86 : CMPXCHG r8, r8 L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  560. Inst 550 X86 : CMPXCHG r16, r16 L: 3.58ns= 7.5c T: 2.15ns= 4.50c
  561. Inst 551 X86 : CMPXCHG r32, r32 L: 3.58ns= 7.5c T: 2.15ns= 4.50c
  562. Inst 552 AMD64 : CMPXCHG r64, r64 L: 3.58ns= 7.5c T: 2.15ns= 4.50c
  563. Inst 553 CMPX8 : CMPXCHG8B L: 10.66ns= 22.3c T: 4.73ns= 9.92c
  564. Inst 554 CMPX16: CMPXCHG16B L: 30.06ns= 63.0c T: 30.06ns= 63.00c
  565. Inst 555 X86 : RDTSC L: [no true dep.] T: 56.38ns=118.17c
  566. Inst 556 X86 : CPUID (EAX = 0) L: 82.30ns=172.5c T: 82.30ns=172.50c
  567. Inst 557 X86 : CPUID (EAX = 1) L: 208.93ns=437.9c T: 208.93ns=437.92c
  568. Inst 558 POPCNT: POPCNT r16, r16 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  569. Inst 559 POPCNT: POPCNT r32, r32 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  570. Inst 560 POPCNT: POPCNT r64, r64 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  571. Inst 561 ABM : LZCNT r16, r16 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  572. Inst 562 ABM : LZCNT r32, r32 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  573. Inst 563 ABM : LZCNT r64, r64 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  574. Inst 564 SSE4.2: CRC32 r32, r8 L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  575. Inst 565 SSE4.2: CRC32 r32, r16 L: 3.58ns= 7.5c T: 3.58ns= 7.50c
  576. Inst 566 SSE4.2: CRC32 r32, r32 L: 4.29ns= 9.0c T: 4.29ns= 9.00c
  577. Inst 567 SSE4.2: CRC32 r64, r8 L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  578. Inst 568 SSE4.2: CRC32 r64, r16 L: 7.16ns= 15.0c T: 5.88ns= 12.33c
  579. Inst 569 X87 : FNOP L: [no true dep.] T: 0.18ns= 0.38c
  580. Inst 570 X87 : FXCH st(i) L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  581. Inst 571 X87 : FCHS L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  582. Inst 572 X87 : FABS L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  583. Inst 573 X87 : FTST L: [no true dep.] T: 0.36ns= 0.75c
  584. Inst 574 X87 : FXAM L: [no true dep.] T: 0.36ns= 0.75c
  585. Inst 575 CMOV : FCMOVE st, st(i) L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  586. Inst 576 X87 : FADD st(i), st (st = 0.0) L: 3.58ns= 7.5c T: 0.48ns= 1.00c
  587. Inst 577 X87 : FADD st(i), st L: 3.58ns= 7.5c T: 0.48ns= 1.00c
  588. Inst 578 X87 : FADD st, st(i), FXCH st(i) L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  589. Inst 579 X87 : FMUL st(i), st (st = 0.0) L: 3.58ns= 7.5c T: 0.48ns= 1.00c
  590. Inst 580 X87 : FMUL st(i), st L: 3.58ns= 7.5c T: 0.48ns= 1.00c
  591. Inst 581 X87 : FMUL st, st(i), FXCH st(i) L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  592. Inst 582 X87 : FMUL + FADD st, st(i) L: 7.16ns= 15.0c T: [not enough reg]
  593. Inst 583 X87 : FMUL st(2i) FADD st(2i+1) L: 3.58ns= 7.5c T: [not enough reg]
  594. Inst 584 X87 : FDIV32 st(i), st L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  595. Inst 585 X87 : FDIV64 st(i), st L: 17.18ns= 36.0c T: 6.80ns= 14.25c
  596. Inst 586 X87 : FDIV80 st(i), st L: 19.32ns= 40.5c T: 7.87ns= 16.50c
  597. Inst 587 X87 : FDIV80 (0.0l/x) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  598. Inst 588 X87 : FDIV80 (x/1.0l) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  599. Inst 589 X87 : FDIV80 (x/2.0l) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  600. Inst 590 X87 : FDIV80 (x/0.5l) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  601. Inst 591 X87 : FSQRT32 st L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  602. Inst 592 X87 : FSQRT64 st L: 17.89ns= 37.5c T: 7.16ns= 15.00c
  603. Inst 593 X87 : FSQRT80 st L: 20.04ns= 42.0c T: 8.23ns= 17.25c
  604. Inst 594 X87 : FSQRT80 (0.0l) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  605. Inst 595 X87 : FSQRT80 (1.0l) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  606. Inst 596 X87 : FDECSTP L: [no true dep.] T: 0.18ns= 0.38c
  607. Inst 597 X87 : FINCSTP L: [no true dep.] T: 0.18ns= 0.38c
  608. Inst 598 X87 : FCOM st(i) L: [no true dep.] T: 0.36ns= 0.75c
  609. Inst 599 CMOV : FCOMI st, st(i) L: [no true dep.] T: 0.72ns= 1.50c
  610. Inst 600 X87 : FSIN80 (0.0) L: 54.11ns=113.4c T: 54.91ns=115.08c
  611. Inst 601 X87 : FSIN80 (0.0) + FADD L: 58.60ns=122.8c T: 54.19ns=113.58c
  612. Inst 602 X87 : FSIN80 (1.0) + FADD L: 116.89ns=245.0c T: 85.44ns=179.08c
  613. Inst 603 X87 : FSIN80 (4Pi) + FADD L: 139.03ns=291.4c T: 100.63ns=210.92c
  614. Inst 604 X87 : FSIN80 (2Pi) + FADD L: 139.03ns=291.4c T: 100.63ns=210.92c
  615. Inst 605 X87 : FSIN80 (Pi) + FADD L: 138.36ns=290.0c T: 99.55ns=208.67c
  616. Inst 606 X87 : FSIN80 (Pi/2) + FADD L: 138.36ns=290.0c T: 99.59ns=208.75c
  617. Inst 607 X87 : FSIN80 (Pi/4) + FADD L: 116.09ns=243.3c T: 83.81ns=175.67c
  618. Inst 608 X87 : FSIN80 (Pi/8) + FADD L: 113.97ns=238.8c T: 77.71ns=162.83c
  619. Inst 609 X87 : FSIN80 (Pi/16) + FADD L: 114.07ns=239.1c T: 77.65ns=162.75c
  620. Inst 610 X87 : FSIN80 (Pi/32) + FADD L: 113.95ns=238.8c T: 77.69ns=162.83c
  621. Inst 611 X87 : FCOS80 (0.73908513...) L: 112.55ns=235.9c T: 79.83ns=167.33c
  622. Inst 612 X87 : FCOS80 (0.73908513...)+FADD L: 114.50ns=240.0c T: 81.22ns=170.25c
  623. Inst 613 X87 : FCOS80 (0.0) + FADD L: 59.32ns=124.3c T: 54.19ns=113.58c
  624. Inst 614 X87 : FCOS80 (1.0) + FADD L: 116.65ns=244.5c T: 85.48ns=179.17c
  625. Inst 615 X87 : FCOS80 (4Pi) + FADD L: 138.84ns=291.0c T: 100.11ns=209.83c
  626. Inst 616 X87 : FCOS80 (2Pi) + FADD L: 138.84ns=291.0c T: 101.03ns=211.75c
  627. Inst 617 X87 : FCOS80 (Pi) + FADD L: 139.55ns=292.5c T: 101.70ns=213.17c
  628. Inst 618 X87 : FCOS80 (Pi/2) + FADD L: 138.12ns=289.5c T: 100.63ns=210.92c
  629. Inst 619 X87 : FCOS80 (Pi/4) + FADD L: 115.94ns=243.0c T: 84.53ns=177.17c
  630. Inst 620 X87 : FCOS80 (Pi/8) + FADD L: 113.83ns=238.6c T: 77.69ns=162.83c
  631. Inst 621 X87 : FCOS80 (Pi/16) + FADD L: 113.79ns=238.5c T: 77.73ns=162.92c
  632. Inst 622 X87 : FCOS80 (Pi/32) + FADD L: 113.79ns=238.5c T: 77.69ns=162.83c
  633. Inst 623 MMX : EMMS L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  634. Inst 624 MMX : MOVD r32, mm L: [diff. reg. set] T: 0.72ns= 1.50c
  635. Inst 625 MMX : MOVD mm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  636. Inst 626 MMX : MOVD r32, mm+MOVD mm, r32 L: 12.88ns= 27.0c T: 0.48ns= 1.00c
  637. Inst 627 AMD64 : MOVD r64, mm L: [diff. reg. set] T: 0.72ns= 1.50c
  638. Inst 628 AMD64 : MOVD mm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  639. Inst 629 AMD64 : MOVD r64, mm+MOVD mm, r64 L: 12.88ns= 27.0c T: 0.48ns= 1.00c
  640. Inst 630 MMX : MOVD mm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  641. Inst 631 MMX : MOVD [m32], mm L: [memory dep.] T: 0.72ns= 1.50c
  642. Inst 632 MMX : MOVD mm,[m32]+MOVD [m32],mm L: 7.87ns= 16.5c T: 0.91ns= 1.92c
  643. Inst 633 MMX : MOVQ mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  644. Inst 634 MMX : MOVQ mm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  645. Inst 635 MMX : MOVQ [m64], mm L: [memory dep.] T: 0.72ns= 1.50c
  646. Inst 636 MMX : MOVQ mm,[m64]+MOVQ [m64],mm L: 7.87ns= 16.5c T: 1.31ns= 2.75c
  647. Inst 637 MMXP : MOVNTQ [m64], mm L: [memory dep.] T: 2.17ns= 2.17c
  648. Inst 638 MMXP : PMOVMSKB r32, mm L: [diff. reg. set] T: 0.72ns= 1.50c
  649. Inst 639 AMD64 : PMOVMSKB r64, mm L: [diff. reg. set] T: 0.72ns= 1.50c
  650. Inst 640 MMXP : MASKMOVQ mm, mm L: [memory dep.] T: 25.50ns= 25.50c
  651. Inst 641 MMX : PADDB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  652. Inst 642 MMX : PADDW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  653. Inst 643 MMX : PADDD mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  654. Inst 644 SSE2 : PADDQ mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  655. Inst 645 MMX : PADDSB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  656. Inst 646 MMX : PADDSW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  657. Inst 647 MMX : PADDUSB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  658. Inst 648 MMX : PADDUSW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  659. Inst 649 MMX : PSUBB mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  660. Inst 650 MMX : PSUBB mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  661. Inst 651 MMX : PSUBW mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  662. Inst 652 MMX : PSUBW mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  663. Inst 653 MMX : PSUBD mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  664. Inst 654 MMX : PSUBD mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  665. Inst 655 SSE2 : PSUBQ mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  666. Inst 656 SSE2 : PSUBQ mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  667. Inst 657 MMX : PSUBSB mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  668. Inst 658 MMX : PSUBSB mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  669. Inst 659 MMX : PSUBSW mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  670. Inst 660 MMX : PSUBSW mm1, mm2 L: 1.43ns= 3.0c T: 0.35ns= 0.73c
  671. Inst 661 MMX : PSUBUSB mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  672. Inst 662 MMX : PSUBUSB mm1, mm2 L: 1.43ns= 3.0c T: 0.35ns= 0.73c
  673. Inst 663 MMX : PSUBUSW mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  674. Inst 664 MMX : PSUBUSW mm1, mm2 L: 1.43ns= 3.0c T: 0.35ns= 0.73c
  675. Inst 665 MMX : PCMPEQB mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  676. Inst 666 MMX : PCMPEQB mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  677. Inst 667 MMX : PCMPEQW mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  678. Inst 668 MMX : PCMPEQW mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  679. Inst 669 MMX : PCMPEQD mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  680. Inst 670 MMX : PCMPEQD mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  681. Inst 671 MMX : PCMPGTB mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  682. Inst 672 MMX : PCMPGTB mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  683. Inst 673 MMX : PCMPGTW mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  684. Inst 674 MMX : PCMPGTW mm1, mm2 L: 1.43ns= 3.0c T: 0.35ns= 0.73c
  685. Inst 675 MMX : PCMPGTD mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  686. Inst 676 MMX : PCMPGTD mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  687. Inst 677 MMX : PAND mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  688. Inst 678 MMX : PAND mm1, mm2 L: 1.43ns= 3.0c T: 0.35ns= 0.74c
  689. Inst 679 MMX : PANDN mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  690. Inst 680 MMX : PANDN mm1, mm2 L: 1.43ns= 3.0c T: 0.34ns= 0.72c
  691. Inst 681 MMX : POR mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  692. Inst 682 MMX : POR mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  693. Inst 683 MMX : PXOR mm, mm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  694. Inst 684 MMX : PXOR mm1, mm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  695. Inst 685 MMX : PMULHW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  696. Inst 686 MMXP : PMULHUW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  697. Inst 688 SSSE3 : PMULHRSW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  698. Inst 689 MMX : PMULLW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  699. Inst 690 SSE2 : PMULUDQ mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  700. Inst 691 SSSE3 : PMADDUBSW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  701. Inst 692 MMX : PMADDWD mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  702. Inst 693 MMX : PSLLW mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  703. Inst 694 MMX : PSLLW mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  704. Inst 695 MMX : PSLLD mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  705. Inst 696 MMX : PSLLD mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  706. Inst 697 MMX : PSLLQ mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  707. Inst 698 MMX : PSLLQ mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  708. Inst 699 MMX : PSRAW mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  709. Inst 700 MMX : PSRAW mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  710. Inst 701 MMX : PSRAD mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  711. Inst 702 MMX : PSRAD mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  712. Inst 703 MMX : PSRLW mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  713. Inst 704 MMX : PSRLW mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  714. Inst 705 MMX : PSRLD mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  715. Inst 706 MMX : PSRLD mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  716. Inst 707 MMX : PSRLQ mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  717. Inst 708 MMX : PSRLQ mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  718. Inst 709 MMX : PUNPCKHBW mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  719. Inst 710 MMX : PUNPCKHWD mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  720. Inst 711 MMX : PUNPCKHDQ mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  721. Inst 712 MMX : PUNPCKLBW mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  722. Inst 713 MMX : PUNPCKLWD mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  723. Inst 714 MMX : PUNPCKLDQ mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  724. Inst 715 MMX : PACKSSWB mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  725. Inst 716 MMX : PACKUSWB mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  726. Inst 717 MMX : PACKSSDW mm, mm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  727. Inst 753 MMXP : PAVGB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  728. Inst 754 MMXP : PAVGW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  729. Inst 755 MMXP : PEXTRW r32, mm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  730. Inst 756 MMXP : PINSRW mm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  731. Inst 757 MMXP : PEXTRW + PINSRW r32 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  732. Inst 758 AMD64 : PEXTRW r64, mm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  733. Inst 759 AMD64 : PINSRW mm, r64, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  734. Inst 760 AMD64 : PEXTRW + PINSRW r64 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  735. Inst 761 MMXP : PMAXSW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  736. Inst 762 MMXP : PMAXUB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  737. Inst 763 MMXP : PMINSW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  738. Inst 764 MMXP : PMINUB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  739. Inst 765 MMXP : PSADBW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  740. Inst 766 MMXP : PSHUFW mm, mm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  741. Inst 767 MMXP : PREFETCHNTA [mem] L: [memory dep.] T: 0.36ns= 0.75c
  742. Inst 768 MMXP : PREFETCHT0 [mem] L: [memory dep.] T: 0.36ns= 0.75c
  743. Inst 769 MMXP : PREFETCHT1 [mem] L: [memory dep.] T: 0.36ns= 0.75c
  744. Inst 770 MMXP : PREFETCHT2 [mem] L: [memory dep.] T: 0.36ns= 0.75c
  745. Inst 771 MMXP : SFENCE L: 63.73ns=133.6c T: 63.73ns=133.58c
  746. Inst 772 SSE2 : LFENCE L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  747. Inst 773 SSE2 : MFENCE L: 63.73ns=133.6c T: 63.73ns=133.58c
  748. Inst 774 SSSE3 : PABSB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  749. Inst 775 SSSE3 : PABSW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  750. Inst 776 SSSE3 : PABSD mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  751. Inst 777 SSSE3 : PALIGNR mm, mm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  752. Inst 778 SSSE3 : PHADDW mm, mm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  753. Inst 779 SSSE3 : PHADDD mm, mm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  754. Inst 780 SSSE3 : PHADDSW mm, mm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  755. Inst 781 SSSE3 : PHSUBW mm, mm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  756. Inst 782 SSSE3 : PHSUBD mm, mm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  757. Inst 783 SSSE3 : PHSUBSW mm, mm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  758. Inst 784 SSSE3 : PSHUFB mm, mm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  759. Inst 785 SSSE3 : PSIGNB mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  760. Inst 786 SSSE3 : PSIGNW mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  761. Inst 787 SSSE3 : PSIGND mm, mm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  762. Inst 788 SSE : MOVHLPS xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  763. Inst 789 SSE : MOVHLPS xmm1, xmm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  764. Inst 790 AVX : VMOVHLPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  765. Inst 791 AVX : VMOVHLPS xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  766. Inst 792 SSE : MOVSS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  767. Inst 793 AVX : VMOVSS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  768. Inst 794 SSE : MOVSS xmm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  769. Inst 795 SSE : MOVSS [m32], xmm L: [memory dep.] T: 0.72ns= 1.50c
  770. Inst 796 SSE : MOVSS LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  771. Inst 797 AVX : VMOVSS xmm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  772. Inst 798 AVX : VMOVSS [m32], xmm L: [memory dep.] T: 0.72ns= 1.50c
  773. Inst 799 AVX : VMOVSS LS pair L: 7.87ns= 16.5c T: 0.72ns= 1.50c
  774. Inst 800 SSE : MOVLPS xmm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  775. Inst 801 SSE : MOVLPS [m32], xmm L: [memory dep.] T: 0.72ns= 1.50c
  776. Inst 802 SSE : MOVLPS LS pair L: 9.30ns= 19.5c T: 0.48ns= 1.00c
  777. Inst 803 AVX : VMOVLPS xmm, xmm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  778. Inst 804 AVX : VMOVLPS [m32], xmm L: [memory dep.] T: 0.72ns= 1.50c
  779. Inst 805 AVX : VMOVLPS LS pair L: 9.30ns= 19.5c T: 0.17ns= 0.36c
  780. Inst 806 SSE : MOVHPS xmm, [m32] L: [memory dep.] T: 0.72ns= 1.50c
  781. Inst 807 SSE : MOVHPS [m32], xmm L: [memory dep.] T: 1.43ns= 3.00c
  782. Inst 808 SSE : MOVHPS LS pair L: 10.73ns= 22.5c T: 2.15ns= 4.50c
  783. Inst 809 AVX : VMOVHPS xmm, xmm, [m32] L: [memory dep.] T: 0.72ns= 1.50c
  784. Inst 810 AVX : VMOVHPS [m32], xmm L: [memory dep.] T: 1.43ns= 3.00c
  785. Inst 811 AVX : VMOVHPS LS pair L: 10.73ns= 22.5c T: 2.15ns= 4.50c
  786. Inst 812 SSE : MOVAPS xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  787. Inst 813 SSE : MOVAPS xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  788. Inst 814 SSE : MOVAPS [m128], xmm L: [memory dep.] T: 0.76ns= 1.58c
  789. Inst 815 SSE : MOVAPS LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  790. Inst 816 AVX : VMOVAPS xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  791. Inst 817 AVX : VMOVAPS xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  792. Inst 818 AVX : VMOVAPS [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  793. Inst 819 AVX : VMOVAPS LS pair L: 7.87ns= 16.5c T: 0.52ns= 1.08c
  794. Inst 820 SSE : MOVUPS xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  795. Inst 821 SSE : MOVUPS xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  796. Inst 822 SSE : MOVUPS [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  797. Inst 823 SSE : MOVUPS aligned LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  798. Inst 824 SSE : MOVUPS xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  799. Inst 825 SSE : MOVUPS [m128 + 4], xmm L: [memory dep.] T: 1.43ns= 3.00c
  800. Inst 826 SSE : MOVUPS unaligned LS pair L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  801. Inst 827 AVX : VMOVUPS xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  802. Inst 828 AVX : VMOVUPS xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  803. Inst 829 AVX : VMOVUPS [m128], xmm L: [memory dep.] T: 0.76ns= 1.58c
  804. Inst 830 AVX : VMOVUPS aligned LS pair L: 7.87ns= 16.5c T: 0.52ns= 1.08c
  805. Inst 831 AVX : VMOVUPS xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  806. Inst 832 AVX : VMOVUPS [m128 + 4], xmm L: [memory dep.] T: 1.43ns= 3.00c
  807. Inst 833 AVX : VMOVUPS unaligned LS pair L: 8.59ns= 18.0c T: 0.91ns= 1.92c
  808. Inst 834 SSE4A : MOVNTSS [m32], xmm L: [memory dep.] T: 2.17ns= 2.17c
  809. Inst 835 SSE : MOVNTPS [m128], xmm L: [memory dep.] T: 2.17ns= 2.17c
  810. Inst 836 AVX : VMOVNTPS [m128], xmm L: [memory dep.] T: 2.17ns= 2.17c
  811. Inst 837 SSE : MOVMSKPS r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  812. Inst 838 AVX : VMOVMSKPS r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  813. Inst 839 AVX : VMASKMOVPS xmm,xmm,[m128+4] L: [memory dep.] T: 0.72ns= 1.50c
  814. Inst 840 AVX : VMASKMOVPS [m128+4],xmm,xmm L: [memory dep.] T: 5.73ns= 12.00c
  815. Inst 841 AVX : VMASKMOVPS unaligned LSpair L: 19.56ns= 41.0c T: 5.09ns= 10.67c
  816. Inst 842 SSE : UNPCKLPS xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  817. Inst 843 AVX : VUNPCKLPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  818. Inst 844 SSE : UNPCKHPS xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  819. Inst 845 AVX : VUNPCKHPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  820. Inst 846 SSE : SHUFPS xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  821. Inst 847 AVX : VSHUFPS xmm, xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  822. Inst 848 AVX : VPERMILPS xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  823. Inst 849 AVX : VPERMILPS xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  824. Inst 850 SSE : COMISS xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  825. Inst 851 AVX : VCOMISS xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  826. Inst 852 SSE : UCOMISS xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  827. Inst 853 AVX : VUCOMISS xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  828. Inst 854 SSE : CMPSS xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  829. Inst 855 SSE : CMPPS xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  830. Inst 856 AVX : VCMPSS xmm, xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  831. Inst 857 AVX : VCMPPS xmm, xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  832. Inst 858 SSE : SUBSS xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  833. Inst 859 AVX : VSUBSS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  834. Inst 860 SSE : SUBPS xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  835. Inst 861 AVX : VSUBPS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  836. Inst 862 SSE : ADDSS xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  837. Inst 863 AVX : VADDSS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  838. Inst 864 SSE : ADDPS xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  839. Inst 865 AVX : VADDPS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  840. Inst 866 SSE : MULSS xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  841. Inst 867 AVX : VMULSS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  842. Inst 868 SSE : MULPS xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  843. Inst 869 AVX : VMULPS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  844. Inst 870 SSE : MULSS+ADDSS xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  845. Inst 871 AVX : VMULSS+VADDSS xmm, xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  846. Inst 872 SSE : MULPS+ADDPS xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  847. Inst 873 AVX : VMULPS+VADDPS xmm, xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  848. Inst 874 SSE : MULSS xm1,xm1 ADDSS xm2,xm2 L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  849. Inst 875 AVX : VMULSS xmm1.. VADDSS xmm2.. L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  850. Inst 876 SSE : MULPS xm1,xm1 ADDPS xm2,xm2 L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  851. Inst 877 AVX : VMULPS xmm1.. VADDPS xmm2.. L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  852. Inst 878 SSE : MAXSS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  853. Inst 879 AVX : VMAXSS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  854. Inst 880 SSE : MAXPS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  855. Inst 881 AVX : VMAXPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  856. Inst 882 SSE : MINSS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  857. Inst 883 AVX : VMINSS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  858. Inst 884 SSE : MINPS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  859. Inst 885 AVX : VMINPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  860. Inst 886 SSE : ANDNPS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  861. Inst 887 SSE : ANDNPS xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  862. Inst 888 AVX : VANDNPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  863. Inst 889 AVX : VANDNPS xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  864. Inst 890 SSE : ANDPS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  865. Inst 891 SSE : ANDPS xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  866. Inst 892 AVX : VANDPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  867. Inst 893 AVX : VANDPS xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  868. Inst 894 SSE : ORPS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  869. Inst 895 SSE : ORPS xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  870. Inst 896 AVX : VORPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  871. Inst 897 AVX : VORPS xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  872. Inst 898 SSE : XORPS xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  873. Inst 899 SSE : XORPS xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  874. Inst 900 AVX : VXORPS xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  875. Inst 901 AVX : VXORPS xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  876. Inst 902 SSE : DIVSS xmm, xmm L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  877. Inst 903 SSE : DIVSS (0.0f/x) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  878. Inst 904 SSE : DIVSS (x/1.0f) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  879. Inst 905 SSE : DIVSS (x/2.0f) L: 9.94ns= 20.8c T: 3.14ns= 6.58c
  880. Inst 906 SSE : DIVSS (x/0.5f) L: 9.94ns= 20.8c T: 3.14ns= 6.58c
  881. Inst 907 AVX : VDIVSS xmm, xmm, xmm L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  882. Inst 908 AVX : VDIVSS (0.0f/x) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  883. Inst 909 AVX : VDIVSS (x/1.0f) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  884. Inst 910 AVX : VDIVSS (x/2.0f) L: 9.90ns= 20.8c T: 3.14ns= 6.58c
  885. Inst 911 AVX : VDIVSS (x/0.5f) L: 9.90ns= 20.8c T: 3.14ns= 6.58c
  886. Inst 912 SSE : DIVPS xmm, xmm L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  887. Inst 913 SSE : DIVPS (0.0f/x) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  888. Inst 914 SSE : DIVPS (x/1.0f) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  889. Inst 915 SSE : DIVPS (x/2.0f) L: 8.07ns= 16.9c T: 3.14ns= 6.58c
  890. Inst 916 SSE : DIVPS (x/0.5f) L: 8.07ns= 16.9c T: 3.14ns= 6.58c
  891. Inst 917 AVX : VDIVPS xmm, xmm, xmm L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  892. Inst 918 AVX : VDIVPS (0.0f/x) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  893. Inst 919 AVX : VDIVPS (x/1.0f) L: 10.02ns= 21.0c T: 3.22ns= 6.75c
  894. Inst 920 AVX : VDIVPS (x/2.0f) L: 8.07ns= 16.9c T: 3.14ns= 6.58c
  895. Inst 921 AVX : VDIVPS (x/0.5f) L: 8.07ns= 16.9c T: 3.14ns= 6.58c
  896. Inst 922 SSE : SQRTSS xmm, xmm L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  897. Inst 923 SSE : SQRTSS (0.0f) L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  898. Inst 924 SSE : SQRTSS (1.0f) L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  899. Inst 925 AVX : VSQRTSS xmm, xmm L: 10.73ns= 22.5c T: 7.16ns= 15.00c
  900. Inst 926 AVX : VSQRTSS (0.0f) L: 10.73ns= 22.5c T: 7.16ns= 15.00c
  901. Inst 927 AVX : VSQRTSS (1.0f) L: 10.73ns= 22.5c T: 7.16ns= 15.00c
  902. Inst 928 SSE : SQRTPS xmm, xmm L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  903. Inst 929 SSE : SQRTPS (0.0f) L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  904. Inst 930 SSE : SQRTPS (1.0f) L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  905. Inst 931 AVX : VSQRTPS xmm, xmm L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  906. Inst 932 AVX : VSQRTPS (0.0f) L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  907. Inst 933 AVX : VSQRTPS (1.0f) L: 10.73ns= 22.5c T: 3.58ns= 7.50c
  908. Inst 934 SSE : RCPSS xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  909. Inst 935 AVX : VRCPSS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  910. Inst 936 SSE : RCPPS xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  911. Inst 937 AVX : VRCPPS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  912. Inst 938 SSE : RSQRTSS xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  913. Inst 939 AVX : VRSQRTSS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  914. Inst 940 SSE : RSQRTPS xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  915. Inst 941 AVX : VRSQRTPS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  916. Inst 942 SSE : CVTPI2PS xmm, mm L: [diff. reg. set] T: 0.72ns= 1.50c
  917. Inst 943 SSE : CVTPS2PI mm, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  918. Inst 944 SSE : CVTPS2PI + CVTPI2PS L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  919. Inst 945 SSE : CVTTPS2PI mm, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  920. Inst 946 SSE : CVTTPS2PI + CVTPI2PS L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  921. Inst 947 SSE : CVTSI2SS xmm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  922. Inst 948 SSE : CVTSS2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  923. Inst 949 SSE : CVTSS2SI + CVTSI2SS r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  924. Inst 950 SSE : CVTTSS2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  925. Inst 951 SSE : CVTTSS2SI + CVTSI2SS r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  926. Inst 952 AVX : VCVTSI2SS xmm, xmm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  927. Inst 953 AVX : VCVTSS2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  928. Inst 954 AVX : VCVTSS2SI + VCVTSI2SS r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  929. Inst 955 AVX : VCVTTSS2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  930. Inst 956 AVX : VCVTTSS2SI + VCVTSI2SS r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  931. Inst 957 AMD64 : CVTSI2SS xmm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  932. Inst 958 AMD64 : CVTSS2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  933. Inst 959 AMD64 : CVTSS2SI + CVTSI2SS r64 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  934. Inst 960 AMD64 : CVTTSS2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  935. Inst 961 AMD64 : CVTTSS2SI + CVTSI2SS r64 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  936. Inst 962 AVX : VCVTSI2SS xmm, xmm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  937. Inst 963 AVX : VCVTSS2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  938. Inst 964 AVX : VCVTSS2SI + VCVTSI2SS r64 L: 18.61ns= 39.0c T: 0.68ns= 1.42c
  939. Inst 965 AVX : VCVTTSS2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  940. Inst 966 AVX : VCVTTSS2SI + VCVTSI2SS r64 L: 18.61ns= 39.0c T: 0.68ns= 1.42c
  941. Inst 967 SSE : STMXCSR [mem] L: [memory dep.] T: 10.73ns= 22.50c
  942. Inst 968 SSE : LDMXCSR [mem] L: [memory dep.] T: 156.92ns=328.92c
  943. Inst 969 SSE : STMXCSR + LDMXCSR L: 21.59ns= 45.3c T: 22.42ns= 47.00c
  944. Inst 970 SSE2 : MOVSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  945. Inst 971 SSE2 : MOVSD xmm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  946. Inst 972 SSE2 : MOVSD [m64], xmm L: [memory dep.] T: 0.72ns= 1.50c
  947. Inst 973 SSE2 : MOVSD LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  948. Inst 974 AVX : VMOVSD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  949. Inst 975 AVX : VMOVSD xmm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  950. Inst 976 AVX : VMOVSD [m64], xmm L: [memory dep.] T: 0.72ns= 1.50c
  951. Inst 977 AVX : VMOVSD LS pair L: 7.87ns= 16.5c T: 0.38ns= 0.80c
  952. Inst 978 SSE2 : MOVLPD xmm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  953. Inst 979 SSE2 : MOVLPD [m64], xmm L: [memory dep.] T: 0.72ns= 1.50c
  954. Inst 980 SSE2 : MOVLPD LS pair L: 9.30ns= 19.5c T: 0.48ns= 1.00c
  955. Inst 981 AVX : VMOVLPD xmm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  956. Inst 982 AVX : VMOVLPD [m64], xmm L: [memory dep.] T: 0.72ns= 1.50c
  957. Inst 983 AVX : VMOVLPD LS pair L: 9.30ns= 19.5c T: 0.18ns= 0.38c
  958. Inst 984 SSE2 : MOVHPD xmm, [m64] L: [memory dep.] T: 0.72ns= 1.50c
  959. Inst 985 SSE2 : MOVHPD [m64], xmm L: [memory dep.] T: 1.43ns= 3.00c
  960. Inst 986 SSE2 : MOVHPD LS pair L: 10.73ns= 22.5c T: 2.15ns= 4.50c
  961. Inst 987 AVX : VMOVHPD xmm, [m64] L: [memory dep.] T: 0.72ns= 1.50c
  962. Inst 988 AVX : VMOVHPD [m64], xmm L: [memory dep.] T: 1.43ns= 3.00c
  963. Inst 989 AVX : VMOVHPD LS pair L: 10.73ns= 22.5c T: 2.15ns= 4.50c
  964. Inst 990 SSE2 : MOVAPD xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  965. Inst 991 SSE2 : MOVAPD xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  966. Inst 992 SSE2 : MOVAPD [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  967. Inst 993 SSE2 : MOVAPD LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  968. Inst 994 AVX : VMOVAPD xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  969. Inst 995 AVX : VMOVAPD xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  970. Inst 996 AVX : VMOVAPD [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  971. Inst 997 AVX : VMOVAPD LS pair L: 7.87ns= 16.5c T: 0.56ns= 1.17c
  972. Inst 998 SSE2 : MOVUPD xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  973. Inst 999 SSE2 : MOVUPD xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  974. Inst 1000 SSE2 : MOVUPD [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  975. Inst 1001 SSE2 : MOVUPD aligned LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  976. Inst 1002 SSE2 : MOVUPD xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  977. Inst 1003 SSE2 : MOVUPD [m128 + 4], xmm L: [memory dep.] T: 1.43ns= 3.00c
  978. Inst 1004 SSE2 : MOVUPD unaligned LS pair L: 8.59ns= 18.0c T: 0.91ns= 1.92c
  979. Inst 1005 AVX : VMOVUPD xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  980. Inst 1006 AVX : VMOVUPD xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  981. Inst 1007 AVX : VMOVUPD [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  982. Inst 1008 AVX : VMOVUPD aligned LS pair L: 7.87ns= 16.5c T: 0.44ns= 0.92c
  983. Inst 1009 AVX : VMOVUPD xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  984. Inst 1010 AVX : VMOVUPD [m128 + 4], xmm L: [memory dep.] T: 1.43ns= 3.00c
  985. Inst 1011 AVX : VMOVUPD unaligned LS pair L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  986. Inst 1012 SSE4A : MOVNTSD [m64], xmm L: [memory dep.] T: 2.17ns= 2.17c
  987. Inst 1013 SSE2 : MOVNTPD [m128], xmm L: [memory dep.] T: 2.17ns= 2.17c
  988. Inst 1014 AVX : VMOVNTPD [m128], xmm L: [memory dep.] T: 2.17ns= 2.17c
  989. Inst 1015 SSE2 : MOVMSKPD r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  990. Inst 1016 AVX : VMOVMSKPD r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  991. Inst 1017 AVX : VMASKMOVPD xmm,xmm,[m128+4] L: [memory dep.] T: 0.72ns= 1.50c
  992. Inst 1018 AVX : VMASKMOVPD [m128+4],xmm,xmm L: [memory dep.] T: 5.73ns= 12.00c
  993. Inst 1019 AVX : VMASKMOVPD unaligned LSpair L: 19.56ns= 41.0c T: 5.73ns= 12.00c
  994. Inst 1020 SSE2 : UNPCKLPD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  995. Inst 1021 AVX : VUNPCKLPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  996. Inst 1022 SSE2 : UNPCKHPD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  997. Inst 1023 AVX : VUNPCKHPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  998. Inst 1024 SSE2 : SHUFPD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  999. Inst 1025 AVX : VSHUFPD xmm, xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1000. Inst 1026 AVX : VPERMILPD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1001. Inst 1027 AVX : VPERMILPD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1002. Inst 1028 SSE2 : COMISD xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1003. Inst 1029 AVX : VCOMISD xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1004. Inst 1030 SSE2 : UCOMISD xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1005. Inst 1031 AVX : VUCOMISD xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1006. Inst 1032 SSE2 : CMPSD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1007. Inst 1033 SSE2 : CMPPD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1008. Inst 1034 AVX : VCMPSD xmm, xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1009. Inst 1035 AVX : VCMPPD xmm, xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1010. Inst 1036 SSE2 : SUBSD xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1011. Inst 1037 AVX : VSUBSD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1012. Inst 1038 SSE2 : SUBPD xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1013. Inst 1039 AVX : VSUBPD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1014. Inst 1040 SSE2 : ADDSD xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1015. Inst 1041 AVX : VADDSD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1016. Inst 1042 SSE2 : ADDPD xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1017. Inst 1043 AVX : VADDPD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1018. Inst 1044 SSE2 : MULSD xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1019. Inst 1045 AVX : VMULSD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1020. Inst 1046 SSE2 : MULPD xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1021. Inst 1047 AVX : VMULPD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.36ns= 0.75c
  1022. Inst 1048 SSE2 : MULSD+ADDSD xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  1023. Inst 1049 AVX : VMULSD+VADDSD xmm, xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  1024. Inst 1050 SSE2 : MULPD+ADDPD xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  1025. Inst 1051 AVX : VMULPD+VADDPD xmm, xmm, xmm L: 7.16ns= 15.0c T: 0.72ns= 1.50c
  1026. Inst 1052 SSE2 : MULSD xm1,xm1 ADDSD xm2,xm2 L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1027. Inst 1053 AVX : VMULSD xmm1.. VADDSD xmm2.. L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1028. Inst 1054 SSE2 : MULPD xm1,xm1 ADDPD xm2,xm2 L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1029. Inst 1055 AVX : VMULPD xmm1.. VADDPD xmm2.. L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1030. Inst 1056 SSE2 : MAXSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1031. Inst 1057 AVX : VMAXSD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1032. Inst 1058 SSE2 : MAXPD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1033. Inst 1059 AVX : VMAXPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1034. Inst 1060 SSE2 : MINSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1035. Inst 1061 AVX : VMINSD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1036. Inst 1062 SSE2 : MINPD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1037. Inst 1063 AVX : VMINPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1038. Inst 1064 SSE2 : ANDNPD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1039. Inst 1065 SSE2 : ANDNPD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1040. Inst 1066 AVX : VANDNPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1041. Inst 1067 AVX : VANDNPD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1042. Inst 1068 SSE2 : ANDPD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1043. Inst 1069 SSE2 : ANDPD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1044. Inst 1070 AVX : VANDPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1045. Inst 1071 AVX : VANDPD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1046. Inst 1072 SSE2 : ORPD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1047. Inst 1073 SSE2 : ORPD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1048. Inst 1074 AVX : VORPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1049. Inst 1075 AVX : VORPD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1050. Inst 1076 SSE2 : XORPD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1051. Inst 1077 SSE2 : XORPD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1052. Inst 1078 AVX : VXORPD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1053. Inst 1079 AVX : VXORPD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1054. Inst 1080 SSE2 : DIVSD xmm, xmm L: 17.18ns= 36.0c T: 6.80ns= 14.25c
  1055. Inst 1081 SSE2 : DIVSD (0.0/x) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1056. Inst 1082 SSE2 : DIVSD (x/1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1057. Inst 1083 SSE2 : DIVSD (x/2.0) L: 6.36ns= 13.3c T: 3.22ns= 6.75c
  1058. Inst 1084 SSE2 : DIVSD (x/0.5) L: 6.36ns= 13.3c T: 3.22ns= 6.75c
  1059. Inst 1085 AVX : VDIVSD xmm, xmm, xmm L: 17.18ns= 36.0c T: 6.80ns= 14.25c
  1060. Inst 1086 AVX : VDIVSD (0.0/x) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1061. Inst 1087 AVX : VDIVSD (x/1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1062. Inst 1088 AVX : VDIVSD (x/2.0) L: 6.32ns= 13.3c T: 3.22ns= 6.75c
  1063. Inst 1089 AVX : VDIVSD (x/0.5) L: 6.32ns= 13.3c T: 3.22ns= 6.75c
  1064. Inst 1090 SSE2 : DIVPD xmm, xmm L: 17.18ns= 36.0c T: 6.80ns= 14.25c
  1065. Inst 1091 SSE2 : DIVPD (0.0/x) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1066. Inst 1092 SSE2 : DIVPD (x/1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1067. Inst 1093 SSE2 : DIVPD (x/2.0) L: 5.17ns= 10.8c T: 3.22ns= 6.75c
  1068. Inst 1094 SSE2 : DIVPD (x/0.5) L: 5.17ns= 10.8c T: 3.22ns= 6.75c
  1069. Inst 1095 AVX : VDIVPD xmm, xmm, xmm L: 17.18ns= 36.0c T: 6.80ns= 14.25c
  1070. Inst 1096 AVX : VDIVPD (0.0/x) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1071. Inst 1097 AVX : VDIVPD (x/1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1072. Inst 1098 AVX : VDIVPD (x/2.0) L: 5.17ns= 10.8c T: 3.22ns= 6.75c
  1073. Inst 1099 AVX : VDIVPD (x/0.5) L: 5.17ns= 10.8c T: 3.22ns= 6.75c
  1074. Inst 1100 SSE2 : SQRTSD xmm, xmm L: 17.89ns= 37.5c T: 7.16ns= 15.00c
  1075. Inst 1101 SSE2 : SQRTSD (0.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1076. Inst 1102 SSE2 : SQRTSD (1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1077. Inst 1103 AVX : VSQRTSD xmm, xmm L: 17.89ns= 37.5c T: 14.31ns= 30.00c
  1078. Inst 1104 AVX : VSQRTSD (0.0) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1079. Inst 1105 AVX : VSQRTSD (1.0) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1080. Inst 1106 SSE2 : SQRTPD xmm, xmm L: 17.89ns= 37.5c T: 7.16ns= 15.00c
  1081. Inst 1107 SSE2 : SQRTPD (0.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1082. Inst 1108 SSE2 : SQRTPD (1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1083. Inst 1109 AVX : VSQRTPD xmm, xmm L: 17.89ns= 37.5c T: 7.16ns= 15.00c
  1084. Inst 1110 AVX : VSQRTPD (0.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1085. Inst 1111 AVX : VSQRTPD (1.0) L: 6.44ns= 13.5c T: 3.22ns= 6.75c
  1086. Inst 1112 SSE2 : CVTPI2PD xmm, mm L: [diff. reg. set] T: 0.72ns= 1.50c
  1087. Inst 1113 SSE2 : CVTPD2PI mm, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1088. Inst 1114 SSE2 : CVTPD2PI + CVTPI2PD L: 10.02ns= 21.0c T: 0.52ns= 1.08c
  1089. Inst 1115 SSE2 : CVTTPD2PI mm, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1090. Inst 1116 SSE2 : CVTTPD2PI + CVTPI2PD L: 10.02ns= 21.0c T: 0.72ns= 1.50c
  1091. Inst 1117 SSE2 : CVTSI2SD xmm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  1092. Inst 1118 SSE2 : CVTSD2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1093. Inst 1119 SSE2 : CVTSD2SI + CVTSI2SD r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1094. Inst 1120 SSE2 : CVTTSD2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1095. Inst 1121 SSE2 : CVTTSD2SI + CVTSI2SD r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1096. Inst 1122 AVX : VCVTSI2SD xmm, xmm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  1097. Inst 1123 AVX : VCVTSD2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1098. Inst 1124 AVX : VCVTSD2SI + VCVTSI2SD r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1099. Inst 1125 AVX : VCVTTSD2SI r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1100. Inst 1126 AVX : VCVTTSD2SI + VCVTSI2SD r32 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1101. Inst 1127 AMD64 : CVTSI2SD xmm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  1102. Inst 1128 AMD64 : CVTSD2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1103. Inst 1129 AMD64 : CVTSD2SI + CVTSI2SD r64 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1104. Inst 1130 AMD64 : CVTTSD2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1105. Inst 1131 AMD64 : CVTTSD2SI + CVTSI2SD r64 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1106. Inst 1132 AVX : VCVTSI2SD xmm, xmm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  1107. Inst 1133 AVX : VCVTSD2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1108. Inst 1134 AVX : VCVTSD2SI + VCVTSI2SD r64 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1109. Inst 1135 AVX : VCVTTSD2SI r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1110. Inst 1136 AVX : VCVTTSD2SI + VCVTSI2SD r64 L: 18.61ns= 39.0c T: 0.72ns= 1.50c
  1111. Inst 1137 SSE2 : CVTDQ2PD xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1112. Inst 1138 SSE2 : CVTPD2DQ xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1113. Inst 1139 SSE2 : CVTPD2DQ + CVTDQ2PD L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  1114. Inst 1140 SSE2 : CVTTPD2DQ xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1115. Inst 1141 SSE2 : CVTTPD2DQ + CVTDQ2PD L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  1116. Inst 1142 AVX : VCVTDQ2PD xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1117. Inst 1143 AVX : VCVTPD2DQ xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1118. Inst 1144 AVX : VCVTPD2DQ + VCVTDQ2PD L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  1119. Inst 1145 AVX : VCVTTPD2DQ xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1120. Inst 1146 AVX : VCVTTPD2DQ + VCVTDQ2PD L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  1121. Inst 1147 SSE2 : CVTDQ2PS xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1122. Inst 1148 SSE2 : CVTPS2DQ xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1123. Inst 1149 SSE2 : CVTPS2DQ + CVTDQ2PS L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1124. Inst 1150 SSE2 : CVTTPS2DQ xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1125. Inst 1151 SSE2 : CVTTPS2DQ + CVTDQ2PS L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1126. Inst 1152 AVX : VCVTDQ2PS xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1127. Inst 1153 AVX : VCVTPS2DQ xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1128. Inst 1154 AVX : VCVTPS2DQ + VCVTDQ2PS L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1129. Inst 1155 AVX : VCVTTPS2DQ xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1130. Inst 1156 AVX : VCVTTPS2DQ + VCVTDQ2PS L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1131. Inst 1157 SSE2 : CVTPS2PD xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1132. Inst 1158 SSE2 : CVTPD2PS xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1133. Inst 1159 SSE2 : CVTPD2PS + CVTPS2PD L: 8.59ns= 18.0c T: 0.72ns= 1.50c
  1134. Inst 1160 SSE2 : CVTSS2SD xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1135. Inst 1161 SSE2 : CVTSD2SS xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1136. Inst 1162 SSE2 : CVTSD2SS + CVTSS2SD L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1137. Inst 1163 AVX : VCVTPS2PD xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1138. Inst 1164 AVX : VCVTPD2PS xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1139. Inst 1165 AVX : VCVTPD2PS + VCVTPS2PD L: 8.59ns= 18.0c T: 0.68ns= 1.42c
  1140. Inst 1166 AVX : VCVTSS2SD xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1141. Inst 1167 AVX : VCVTSD2SS xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1142. Inst 1168 AVX : VCVTSD2SS + VCVTSS2SD L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1143. Inst 1169 SSE2 : MOVD r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1144. Inst 1170 SSE2 : MOVD xmm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  1145. Inst 1171 SSE2 : MOVD r32, xmm+MOVD xmm, r32 L: 12.88ns= 27.0c T: 0.48ns= 1.00c
  1146. Inst 1172 AVX : VMOVD r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1147. Inst 1173 AVX : VMOVD xmm, r32 L: [diff. reg. set] T: 0.72ns= 1.50c
  1148. Inst 1174 AVX : VMOVD r32,xmm+VMOVD xmm,r32 L: 12.88ns= 27.0c T: 0.48ns= 1.00c
  1149. Inst 1175 AMD64 : MOVD r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1150. Inst 1176 AMD64 : MOVD xmm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  1151. Inst 1177 AMD64 : MOVD r64, xmm+MOVD xmm, r64 L: 12.88ns= 27.0c T: 0.48ns= 1.00c
  1152. Inst 1178 AVX : VMOVD r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1153. Inst 1179 AVX : VMOVD xmm, r64 L: [diff. reg. set] T: 0.72ns= 1.50c
  1154. Inst 1180 AVX : VMOVD r64,xmm+VMOVD xmm,r64 L: 12.88ns= 27.0c T: 0.48ns= 1.00c
  1155. Inst 1181 SSE2 : MOVD xmm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  1156. Inst 1182 SSE2 : MOVD [m32], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1157. Inst 1183 SSE2 : MOVD LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  1158. Inst 1184 AVX : VMOVD xmm, [m32] L: [memory dep.] T: 0.36ns= 0.75c
  1159. Inst 1185 AVX : VMOVD [m32], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1160. Inst 1186 AVX : VMOVD LS pair L: 7.87ns= 16.5c T: 1.55ns= 3.25c
  1161. Inst 1187 SSE2 : MOVQ xmm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  1162. Inst 1188 SSE2 : MOVQ [m64], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1163. Inst 1189 SSE2 : MOVQ LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  1164. Inst 1190 AVX : VMOVQ xmm, [m64] L: [memory dep.] T: 0.36ns= 0.75c
  1165. Inst 1191 AVX : VMOVQ [m64], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1166. Inst 1192 AVX : VMOVQ LS pair L: 7.87ns= 16.5c T: 0.64ns= 1.33c
  1167. Inst 1193 SSE2 : MOVDQ2Q mm, xmm L: [diff. reg. set] T: 0.36ns= 0.75c
  1168. Inst 1194 SSE2 : MOVQ2DQ xmm, mm L: [diff. reg. set] T: 0.36ns= 0.75c
  1169. Inst 1195 SSE2 : MOVDQ2Q + MOVQ2DQ xmm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1170. Inst 1196 SSE2 : MOVDQA xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  1171. Inst 1197 SSE2 : MOVDQA xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  1172. Inst 1198 SSE2 : MOVDQA [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1173. Inst 1199 SSE2 : MOVDQA LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  1174. Inst 1200 AVX : VMOVDQA xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  1175. Inst 1201 AVX : VMOVDQA xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  1176. Inst 1202 AVX : VMOVDQA [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1177. Inst 1203 AVX : VMOVDQA LS pair L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  1178. Inst 1204 SSE2 : MOVDQU xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  1179. Inst 1205 SSE2 : MOVDQU xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  1180. Inst 1206 SSE2 : MOVDQU [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1181. Inst 1207 SSE2 : MOVDQU aligned LS pair L: 7.87ns= 16.5c T: 0.48ns= 1.00c
  1182. Inst 1208 SSE2 : MOVDQU xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  1183. Inst 1209 SSE2 : MOVDQU [m128 + 4], xmm L: [memory dep.] T: 1.43ns= 3.00c
  1184. Inst 1210 SSE2 : MOVDQU unaligned LS pair L: 8.59ns= 18.0c T: 0.91ns= 1.92c
  1185. Inst 1211 AVX : VMOVDQU xmm, xmm L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  1186. Inst 1212 AVX : VMOVDQU xmm, [m128] L: [memory dep.] T: 0.36ns= 0.75c
  1187. Inst 1213 AVX : VMOVDQU [m128], xmm L: [memory dep.] T: 0.72ns= 1.50c
  1188. Inst 1214 AVX : VMOVDQU aligned LS pair L: 7.87ns= 16.5c T: 0.56ns= 1.17c
  1189. Inst 1215 AVX : VMOVDQU xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  1190. Inst 1216 AVX : VMOVDQU [m128 + 4], xmm L: [memory dep.] T: 1.43ns= 3.00c
  1191. Inst 1217 AVX : VMOVDQU unaligned LS pair L: 8.59ns= 18.0c T: 1.11ns= 2.33c
  1192. Inst 1218 SSE4.1: MOVNTDQA xmm, [m128] L: [memory dep.] T: 0.75ns= 0.75c
  1193. Inst 1219 SSE2 : MOVNTDQ [m128], xmm L: [memory dep.] T: 2.17ns= 2.17c
  1194. Inst 1220 SSE4.1: MOVNTDQA + MOVNTDQ L: 7.87ns= 16.5c T: 16.50ns= 16.50c
  1195. Inst 1221 AVX : VMOVNTDQA xmm, [m128] L: [memory dep.] T: 0.75ns= 0.75c
  1196. Inst 1222 AVX : VMOVNTDQ [m128], xmm L: [memory dep.] T: 2.17ns= 2.17c
  1197. Inst 1223 AVX : VMOVNTDQA + VMOVNTDQ L: 7.87ns= 16.5c T: 16.50ns= 16.50c
  1198. Inst 1224 SSE2 : PMOVMSKB r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1199. Inst 1225 AMD64 : PMOVMSKB r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1200. Inst 1226 AVX : VPMOVMSKB r32, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1201. Inst 1227 AVX : VPMOVMSKB r64, xmm L: [diff. reg. set] T: 0.72ns= 1.50c
  1202. Inst 1228 SSE2 : MASKMOVDQU xmm, xmm L: [memory dep.] T: 50.00ns= 50.00c
  1203. Inst 1229 AVX : VMASKMOVDQU xmm, xmm L: [memory dep.] T: 49.50ns= 49.50c
  1204. Inst 1230 SSE2 : PADDB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1205. Inst 1231 AVX : VPADDB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1206. Inst 1232 SSE2 : PADDW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1207. Inst 1233 AVX : VPADDW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1208. Inst 1234 SSE2 : PADDD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1209. Inst 1235 AVX : VPADDD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1210. Inst 1236 SSE2 : PADDQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1211. Inst 1237 AVX : VPADDQ xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1212. Inst 1238 SSE2 : PADDSB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1213. Inst 1239 AVX : VPADDSB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1214. Inst 1240 SSE2 : PADDSW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1215. Inst 1241 AVX : VPADDSW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1216. Inst 1242 SSE2 : PADDUSB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1217. Inst 1243 AVX : VPADDUSB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1218. Inst 1244 SSE2 : PADDUSW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1219. Inst 1245 AVX : VPADDUSW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1220. Inst 1246 SSE2 : PSUBB xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1221. Inst 1247 SSE2 : PSUBB xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1222. Inst 1248 AVX : VPSUBB xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1223. Inst 1249 AVX : VPSUBB xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1224. Inst 1250 SSE2 : PSUBW xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1225. Inst 1251 SSE2 : PSUBW xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1226. Inst 1252 AVX : VPSUBW xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1227. Inst 1253 AVX : VPSUBW xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1228. Inst 1254 SSE2 : PSUBD xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1229. Inst 1255 SSE2 : PSUBD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1230. Inst 1256 AVX : VPSUBD xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1231. Inst 1257 AVX : VPSUBD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1232. Inst 1258 SSE2 : PSUBQ xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1233. Inst 1259 SSE2 : PSUBQ xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1234. Inst 1260 AVX : VPSUBQ xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1235. Inst 1261 AVX : VPSUBQ xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1236. Inst 1262 SSE2 : PSUBSB xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1237. Inst 1263 SSE2 : PSUBSB xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1238. Inst 1264 AVX : VPSUBSB xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1239. Inst 1265 AVX : VPSUBSB xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1240. Inst 1266 SSE2 : PSUBSW xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1241. Inst 1267 SSE2 : PSUBSW xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1242. Inst 1268 AVX : VPSUBSW xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1243. Inst 1269 AVX : VPSUBSW xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1244. Inst 1270 SSE2 : PSUBUSB xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1245. Inst 1271 SSE2 : PSUBUSB xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1246. Inst 1272 AVX : VPSUBUSB xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1247. Inst 1273 AVX : VPSUBUSB xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1248. Inst 1274 SSE2 : PSUBUSW xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1249. Inst 1275 SSE2 : PSUBUSW xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1250. Inst 1276 AVX : VPSUBUSW xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1251. Inst 1277 AVX : VPSUBUSW xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1252. Inst 1278 SSE2 : PCMPEQB xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1253. Inst 1279 SSE2 : PCMPEQB xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1254. Inst 1280 AVX : VPCMPEQB xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1255. Inst 1281 AVX : VPCMPEQB xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1256. Inst 1282 SSE2 : PCMPEQW xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1257. Inst 1283 SSE2 : PCMPEQW xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1258. Inst 1284 AVX : VPCMPEQW xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1259. Inst 1285 AVX : VPCMPEQW xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1260. Inst 1286 SSE2 : PCMPEQD xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1261. Inst 1287 SSE2 : PCMPEQD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1262. Inst 1288 AVX : VPCMPEQD xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1263. Inst 1289 AVX : VPCMPEQD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1264. Inst 1290 SSE4.1: PCMPEQQ xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1265. Inst 1291 SSE4.1: PCMPEQQ xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1266. Inst 1292 AVX : VPCMPEQQ xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1267. Inst 1293 AVX : VPCMPEQQ xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1268. Inst 1294 SSE2 : PCMPGTB xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1269. Inst 1295 SSE2 : PCMPGTB xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1270. Inst 1296 AVX : VPCMPGTB xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1271. Inst 1297 AVX : VPCMPGTB xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1272. Inst 1298 SSE2 : PCMPGTW xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1273. Inst 1299 SSE2 : PCMPGTW xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1274. Inst 1300 AVX : VPCMPGTW xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1275. Inst 1301 AVX : VPCMPGTW xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1276. Inst 1302 SSE2 : PCMPGTD xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1277. Inst 1303 SSE2 : PCMPGTD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1278. Inst 1304 AVX : VPCMPGTD xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1279. Inst 1305 AVX : VPCMPGTD xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1280. Inst 1306 SSE4.2: PCMPGTQ xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1281. Inst 1307 SSE4.2: PCMPGTQ xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1282. Inst 1308 AVX : VPCMPGTQ xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1283. Inst 1309 AVX : VPCMPGTQ xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1284. Inst 1310 SSE2 : PAND xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1285. Inst 1311 SSE2 : PAND xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1286. Inst 1312 AVX : VPAND xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1287. Inst 1313 AVX : VPAND xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1288. Inst 1314 SSE2 : PANDN xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1289. Inst 1315 SSE2 : PANDN xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1290. Inst 1316 AVX : VPANDN xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1291. Inst 1317 AVX : VPANDN xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1292. Inst 1318 SSE2 : POR xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1293. Inst 1319 SSE2 : POR xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1294. Inst 1320 AVX : VPOR xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1295. Inst 1321 AVX : VPOR xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1296. Inst 1322 SSE2 : PXOR xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1297. Inst 1323 SSE2 : PXOR xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1298. Inst 1324 AVX : VPXOR xmm, xmm, xmm L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1299. Inst 1325 AVX : VPXOR xmm1, xmm1, xmm2 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1300. Inst 1326 SSE2 : PMULHW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1301. Inst 1327 AVX : VPMULHW xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1302. Inst 1328 SSE2 : PMULHUW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1303. Inst 1329 AVX : VPMULHUW xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1304. Inst 1330 SSSE3 : PMULHRSW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1305. Inst 1331 AVX : VPMULHRSW xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1306. Inst 1332 SSE2 : PMULLW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1307. Inst 1333 AVX : VPMULLW xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1308. Inst 1334 SSE4.1: PMULLD xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1309. Inst 1335 AVX : VPMULLD xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1310. Inst 1336 SSE4.1: PMULDQ xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1311. Inst 1337 AVX : VPMULDQ xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1312. Inst 1338 SSE2 : PMULUDQ xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1313. Inst 1339 AVX : VPMULUDQ xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1314. Inst 1340 SSSE3 : PMADDUBSW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1315. Inst 1341 AVX : VPMADDUBSW xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1316. Inst 1342 SSE2 : PMADDWD xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1317. Inst 1343 AVX : VPMADDWD xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1318. Inst 1344 SSE2 : PSLLW xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1319. Inst 1345 AVX : VPSLLW xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1320. Inst 1346 SSE2 : PSLLW xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1321. Inst 1347 AVX : VPSLLW xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1322. Inst 1348 SSE2 : PSLLD xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1323. Inst 1349 AVX : VPSLLD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1324. Inst 1350 SSE2 : PSLLD xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1325. Inst 1351 AVX : VPSLLD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1326. Inst 1352 SSE2 : PSLLQ xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1327. Inst 1353 AVX : VPSLLQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1328. Inst 1354 SSE2 : PSLLQ xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1329. Inst 1355 AVX : VPSLLQ xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1330. Inst 1356 SSE2 : PSLLDQ xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1331. Inst 1357 AVX : VPSLLDQ xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1332. Inst 1358 SSE2 : PSRAW xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1333. Inst 1359 AVX : VPSRAW xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1334. Inst 1360 SSE2 : PSRAW xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1335. Inst 1361 AVX : VPSRAW xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1336. Inst 1362 SSE2 : PSRAD xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1337. Inst 1363 AVX : VPSRAD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1338. Inst 1364 SSE2 : PSRAD xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1339. Inst 1365 AVX : VPSRAD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1340. Inst 1366 SSE2 : PSRLW xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1341. Inst 1367 AVX : VPSRLW xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1342. Inst 1368 SSE2 : PSRLW xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1343. Inst 1369 AVX : VPSRLW xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1344. Inst 1370 SSE2 : PSRLD xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1345. Inst 1371 AVX : VPSRLD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1346. Inst 1372 SSE2 : PSRLD xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1347. Inst 1373 AVX : VPSRLD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1348. Inst 1374 SSE2 : PSRLQ xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1349. Inst 1375 AVX : VPSRLQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1350. Inst 1376 SSE2 : PSRLQ xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1351. Inst 1377 AVX : VPSRLQ xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1352. Inst 1378 SSE2 : PSRLDQ xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1353. Inst 1379 AVX : VPSRLDQ xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1354. Inst 1380 SSE2 : PUNPCKHBW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1355. Inst 1381 AVX : VPUNPCKHBW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1356. Inst 1382 SSE2 : PUNPCKHWD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1357. Inst 1383 AVX : VPUNPCKHWD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1358. Inst 1384 SSE2 : PUNPCKHDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1359. Inst 1385 AVX : VPUNPCKHDQ xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1360. Inst 1386 SSE2 : PUNPCKHQDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1361. Inst 1387 AVX : VPUNPCKHQDQ xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1362. Inst 1388 SSE2 : PUNPCKLBW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1363. Inst 1389 AVX : VPUNPCKLBW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1364. Inst 1390 SSE2 : PUNPCKLWD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1365. Inst 1391 AVX : VPUNPCKLWD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1366. Inst 1392 SSE2 : PUNPCKLDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1367. Inst 1393 AVX : VPUNPCKLDQ xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1368. Inst 1394 SSE2 : PUNPCKLQDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1369. Inst 1395 AVX : VPUNPCKLQDQ xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1370. Inst 1396 SSE2 : PACKSSWB xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1371. Inst 1397 AVX : VPACKSSWB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1372. Inst 1398 SSE2 : PACKUSWB xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1373. Inst 1399 AVX : VPACKUSWB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1374. Inst 1400 SSE2 : PACKSSDW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1375. Inst 1401 AVX : VPACKSSDW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1376. Inst 1402 SSE4.1: PACKUSDW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1377. Inst 1403 AVX : VPACKUSDW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1378. Inst 1404 SSE2 : PAVGB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1379. Inst 1405 AVX : VPAVGB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1380. Inst 1406 SSE2 : PAVGW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1381. Inst 1407 AVX : VPAVGW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1382. Inst 1408 SSE4.1: PEXTRB r32, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1383. Inst 1409 SSE4.1: PINSRB xmm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1384. Inst 1410 SSE4.1: PEXTRB + PINSRB r32 L: 15.74ns= 33.0c T: 0.56ns= 1.17c
  1385. Inst 1411 AVX : VPEXTRB r32, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1386. Inst 1412 AVX : VPINSRB xmm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1387. Inst 1413 AVX : VPEXTRB + VPINSRB r32 L: 15.74ns= 33.0c T: 0.48ns= 1.00c
  1388. Inst 1414 SSE4.1: PEXTRB r64, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1389. Inst 1415 SSE4.1: PEXTRB r64 + PINSRB r32 L: 15.74ns= 33.0c T: 0.32ns= 0.67c
  1390. Inst 1416 AVX : VPEXTRB r64, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1391. Inst 1417 AVX : VPEXTRB r64 + VPINSRB r32 L: 15.74ns= 33.0c T: 0.26ns= 0.54c
  1392. Inst 1418 SSE2 : PEXTRW r32, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1393. Inst 1419 SSE2 : PINSRW xmm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1394. Inst 1420 SSE2 : PEXTRW + PINSRW r32 L: 15.74ns= 33.0c T: 0.16ns= 0.34c
  1395. Inst 1421 AVX : VPEXTRW r32, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1396. Inst 1422 AVX : VPINSRW xmm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1397. Inst 1423 AVX : VPEXTRW + VPINSRW r32 L: 15.74ns= 33.0c T: 0.56ns= 1.17c
  1398. Inst 1424 AMD64 : PEXTRW r64, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1399. Inst 1425 AMD64 : PEXTRW r64 + PINSRW r32 L: 15.74ns= 33.0c T: 0.36ns= 0.76c
  1400. Inst 1426 AVX : VPEXTRW r64, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1401. Inst 1427 AVX : VPEXTRW r64 + VPINSRW r32 L: 15.74ns= 33.0c T: 0.72ns= 1.50c
  1402. Inst 1428 SSE4.1: PEXTRD r32, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1403. Inst 1429 SSE4.1: PINSRD xmm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1404. Inst 1430 SSE4.1: PEXTRD + PINSRD r32 L: 15.74ns= 33.0c T: 0.36ns= 0.75c
  1405. Inst 1431 AVX : VPEXTRD r32, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1406. Inst 1432 AVX : VPINSRD xmm, r32, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1407. Inst 1433 AVX : VPEXTRD + VPINSRD r32 L: 15.74ns= 33.0c T: 1.23ns= 2.58c
  1408. Inst 1434 SSE4.1: PEXTRQ r64, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1409. Inst 1435 SSE4.1: PINSRQ xmm, r64, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1410. Inst 1436 SSE4.1: PEXTRD + PINSRD r64 L: 15.74ns= 33.0c T: 0.25ns= 0.53c
  1411. Inst 1437 AVX : VPEXTRQ r64, xmm, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1412. Inst 1438 AVX : VPINSRQ xmm, r64, im8 L: [diff. reg. set] T: 0.72ns= 1.50c
  1413. Inst 1439 AVX : VPEXTRQ + VPINSRQ r64 L: 15.74ns= 33.0c T: 0.31ns= 0.65c
  1414. Inst 1440 SSE4.1: EXTRACTPS r32, xmm, im8 L: [diff. reg. set] T: 1.43ns= 3.00c
  1415. Inst 1441 AVX : VEXTRACTPS r32, xmm, im8 L: [diff. reg. set] T: 1.43ns= 3.00c
  1416. Inst 1442 SSE4.1: EXTRACTPS r64, xmm, im8 L: [diff. reg. set] T: 1.43ns= 3.00c
  1417. Inst 1443 AVX : VEXTRACTPS r64, xmm, im8 L: [diff. reg. set] T: 1.43ns= 3.00c
  1418. Inst 1444 SSE4.1: INSERTPS xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1419. Inst 1445 AVX : VINSERTPS xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1420. Inst 1446 SSE4A : EXTRQ xmm, im8, im8 L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1421. Inst 1447 SSE4A : EXTRQ xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1422. Inst 1448 SSE4A : INSERTQ xmm, xmm, im8, im8 L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1423. Inst 1449 SSE4A : INSERTQ xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1424. Inst 1450 SSE2 : PMAXUB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1425. Inst 1451 AVX : VPMAXUB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1426. Inst 1452 SSE4.1: PMAXSB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1427. Inst 1453 AVX : VPMAXSB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1428. Inst 1454 SSE4.1: PMAXUW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1429. Inst 1455 AVX : VPMAXUW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1430. Inst 1456 SSE2 : PMAXSW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1431. Inst 1457 AVX : VPMAXSW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1432. Inst 1458 SSE4.1: PMAXUD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1433. Inst 1459 AVX : VPMAXUD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1434. Inst 1460 SSE4.1: PMAXSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1435. Inst 1461 AVX : VPMAXSD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1436. Inst 1462 SSE2 : PMINUB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1437. Inst 1463 AVX : VPMINUB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1438. Inst 1464 SSE4.1: PMINSB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1439. Inst 1465 AVX : VPMINSB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1440. Inst 1466 SSE4.1: PMINUW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1441. Inst 1467 AVX : VPMINUW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1442. Inst 1468 SSE2 : PMINSW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1443. Inst 1469 AVX : VPMINSW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1444. Inst 1470 SSE4.1: PMINUD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1445. Inst 1471 AVX : VPMINUD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1446. Inst 1472 SSE4.1: PMINSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1447. Inst 1473 AVX : VPMINSD xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1448. Inst 1474 SSE2 : PSADBW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1449. Inst 1475 AVX : VPSADBW xmm, xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1450. Inst 1476 SSSE3 : PSHUFB xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1451. Inst 1477 AVX : VPSHUFB xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1452. Inst 1478 SSE2 : PSHUFLW xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1453. Inst 1479 AVX : VPSHUFLW xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1454. Inst 1480 SSE2 : PSHUFHW xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1455. Inst 1481 AVX : VPSHUFHW xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1456. Inst 1482 SSE2 : PSHUFD xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1457. Inst 1483 AVX : VPSHUFD xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1458. Inst 1484 SSE3 : ADDSUBPS xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1459. Inst 1485 AVX : VADDSUBPS xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1460. Inst 1486 SSE3 : ADDSUBPD xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1461. Inst 1487 AVX : VADDSUBPD xmm, xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1462. Inst 1488 SSE3 : HADDPS xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1463. Inst 1489 AVX : VHADDPS xmm, xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1464. Inst 1490 SSE3 : HADDPD xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1465. Inst 1491 AVX : VHADDPD xmm, xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1466. Inst 1492 SSE3 : HSUBPS xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1467. Inst 1493 AVX : VHSUBPS xmm, xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1468. Inst 1494 SSE3 : HSUBPD xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1469. Inst 1495 AVX : VHSUBPD xmm, xmm, xmm L: 7.87ns= 16.5c T: 1.43ns= 3.00c
  1470. Inst 1496 SSE3 : MOVSLDUP xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1471. Inst 1497 AVX : VMOVSLDUP xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1472. Inst 1498 SSE3 : MOVSHDUP xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1473. Inst 1499 AVX : VMOVSHDUP xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1474. Inst 1500 SSE3 : MOVDDUP xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1475. Inst 1501 AVX : VMOVDDUP xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1476. Inst 1502 SSE3 : LDDQU xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  1477. Inst 1503 AVX : VLDDQU xmm, [m128 + 4] L: [memory dep.] T: 0.72ns= 1.50c
  1478. Inst 1504 SSSE3 : PABSB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1479. Inst 1505 AVX : VPABSB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1480. Inst 1506 SSSE3 : PABSW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1481. Inst 1507 AVX : VPABSW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1482. Inst 1508 SSSE3 : PABSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1483. Inst 1509 AVX : VPABSD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1484. Inst 1510 SSSE3 : PALIGNR xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1485. Inst 1511 AVX : VPALIGNR xmm, xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1486. Inst 1512 SSSE3 : PHADDW xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1487. Inst 1513 AVX : VPHADDW xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1488. Inst 1514 SSSE3 : PHADDD xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1489. Inst 1515 AVX : VPHADDD xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1490. Inst 1516 SSSE3 : PHADDSW xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1491. Inst 1517 AVX : VPHADDSW xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1492. Inst 1518 SSSE3 : PHSUBW xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1493. Inst 1519 AVX : VPHSUBW xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1494. Inst 1520 SSSE3 : PHSUBD xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1495. Inst 1521 AVX : VPHSUBD xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1496. Inst 1522 SSSE3 : PHSUBSW xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1497. Inst 1523 AVX : VPHSUBSW xmm, xmm, xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1498. Inst 1524 SSSE3 : PSIGNB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1499. Inst 1525 AVX : VPSIGNB xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1500. Inst 1526 SSSE3 : PSIGNW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1501. Inst 1527 AVX : VPSIGNW xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1502. Inst 1528 SSSE3 : PSIGND xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1503. Inst 1529 AVX : VPSIGND xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1504. Inst 1530 SSE4.1: BLENDPS xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1505. Inst 1531 AVX : VBLENDPS xmm, xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1506. Inst 1532 SSE4.1: BLENDVPS xmm, xmm, <xmm0> L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1507. Inst 1533 AVX : VBLENDVPS xmm, xmm, xmm, xm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1508. Inst 1534 SSE4.1: BLENDPD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1509. Inst 1535 AVX : VBLENDPD xmm, xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1510. Inst 1536 SSE4.1: BLENDVPD xmm, xmm, <xmm0> L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1511. Inst 1537 AVX : VBLENDVPD xmm, xmm, xmm, xm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1512. Inst 1538 SSE4.1: PBLENDW xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1513. Inst 1539 AVX : VPBLENDW xmm, xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1514. Inst 1540 SSE4.1: PBLENDVB xmm, xmm, <xmm0> L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1515. Inst 1541 AVX : VPBLENDVB xmm, xmm, xmm, xm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1516. Inst 1542 SSE4.1: DPPS xmm, xmm, imm8 L: 17.89ns= 37.5c T: 1.91ns= 4.00c
  1517. Inst 1543 AVX : VDPPS xmm, xmm, xmm, imm8 L: 17.89ns= 37.5c T: 1.79ns= 3.75c
  1518. Inst 1544 SSE4.1: DPPD xmm, xmm, imm8 L: 10.73ns= 22.5c T: 1.91ns= 4.00c
  1519. Inst 1545 AVX : VDPPD xmm, xmm, xmm, imm8 L: 10.73ns= 22.5c T: 1.91ns= 4.00c
  1520. Inst 1546 SSE4.1: MPSADBW xmm, xmm, imm8 L: 7.16ns= 15.0c T: 2.86ns= 6.00c
  1521. Inst 1547 AVX : VMPSADBW xmm, xmm, imm8 L: 7.16ns= 15.0c T: 3.46ns= 7.25c
  1522. Inst 1548 SSE4.1: PHMINPOSUW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1523. Inst 1549 AVX : VPHMINPOSUW xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1524. Inst 1550 SSE4.1: PMOVSXBW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1525. Inst 1551 AVX : VPMOVSXBW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1526. Inst 1552 SSE4.1: PMOVSXBD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1527. Inst 1553 AVX : VPMOVSXBD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1528. Inst 1554 SSE4.1: PMOVSXBQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1529. Inst 1555 AVX : VPMOVSXBQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1530. Inst 1556 SSE4.1: PMOVSXWD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1531. Inst 1557 AVX : VPMOVSXWD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1532. Inst 1558 SSE4.1: PMOVSXWQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1533. Inst 1559 AVX : VPMOVSXWQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1534. Inst 1560 SSE4.1: PMOVSXDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1535. Inst 1561 AVX : VPMOVSXDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1536. Inst 1562 SSE4.1: PMOVZXBW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1537. Inst 1563 AVX : VPMOVZXBW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1538. Inst 1564 SSE4.1: PMOVZXBD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1539. Inst 1565 AVX : VPMOVZXBD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1540. Inst 1566 SSE4.1: PMOVZXBQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1541. Inst 1567 AVX : VPMOVZXBQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1542. Inst 1568 SSE4.1: PMOVZXWD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1543. Inst 1569 AVX : VPMOVZXWD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1544. Inst 1570 SSE4.1: PMOVZXWQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1545. Inst 1571 AVX : VPMOVZXWQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1546. Inst 1572 SSE4.1: PMOVZXDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1547. Inst 1573 AVX : VPMOVZXDQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1548. Inst 1574 SSE4.1: PTEST xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1549. Inst 1575 AVX : VPTEST xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1550. Inst 1576 AVX : VPTESTPS xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1551. Inst 1577 AVX : VPTESTPD xmm, xmm L: [no true dep.] T: 0.72ns= 1.50c
  1552. Inst 1578 SSE4.1: ROUNDSS xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1553. Inst 1579 AVX : VROUNDSS xmm, xmm, xmm, im8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1554. Inst 1580 SSE4.1: ROUNDPS xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1555. Inst 1581 AVX : VROUNDPS xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1556. Inst 1582 SSE4.1: ROUNDSD xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1557. Inst 1583 AVX : VROUNDSD xmm, xmm, xmm, im8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1558. Inst 1584 SSE4.1: ROUNDPD xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1559. Inst 1585 AVX : VROUNDPD xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1560. Inst 1586 AVX : VBROADCASTSS xmm, m32 L: [memory dep.] T: 0.72ns= 1.50c
  1561. Inst 1587 SSE4.2: PCMPESTRI xmm, xmm, imm8 L: 7.87ns= 16.5c T: 7.87ns= 16.50c
  1562. Inst 1588 AVX : VPCMPESTRI xmm, xmm, imm8 L: 7.16ns= 15.0c T: 7.16ns= 15.00c
  1563. Inst 1589 SSE4.2: PCMPESTRM xmm, xmm, imm8 L: 7.16ns= 15.0c T: 7.16ns= 15.00c
  1564. Inst 1590 AVX : VPCMPESTRM xmm, xmm, imm8 L: 7.16ns= 15.0c T: 7.16ns= 15.00c
  1565. Inst 1591 SSE4.2: PCMPISTRI xmm, xmm, imm8 L: 3.58ns= 7.5c T: 3.58ns= 7.50c
  1566. Inst 1592 AVX : VPCMPISTRI xmm, xmm, imm8 L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  1567. Inst 1593 SSE4.2: PCMPISTRM xmm, xmm, imm8 L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  1568. Inst 1594 AVX : VPCMPISTRM xmm, xmm, imm8 L: 2.86ns= 6.0c T: 2.86ns= 6.00c
  1569. Inst 1595 CLMUL : PCLMULQDQ xmm, xmm, imm8 L: 4.29ns= 9.0c T: 2.86ns= 6.00c
  1570. Inst 1596 AVX : VPCLMULQDQ xmm,xmm,xmm,im8 L: 4.29ns= 9.0c T: 2.86ns= 6.00c
  1571. Inst 1597 AESNI : AESENC xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1572. Inst 1598 AVX : VAESENC xmm, xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1573. Inst 1599 AESNI : AESENCLAST xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1574. Inst 1600 AVX : VAESENCLAST xmm, xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1575. Inst 1601 AESNI : AESDEC xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1576. Inst 1602 AVX : VAESDEC xmm, xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1577. Inst 1603 AESNI : AESDECLAST xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1578. Inst 1604 AVX : VAESDECLAST xmm, xmm, xmm L: 4.29ns= 9.0c T: 0.72ns= 1.50c
  1579. Inst 1605 AESNI : AESIMC xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1580. Inst 1606 AVX : VAESIMC xmm, xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1581. Inst 1607 AESNI : AESKEYGEN xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1582. Inst 1608 AVX : VAESKEYGEN xmm, xmm, imm8 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1583. Inst 1609 FMA4 : VFMADDSS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1584. Inst 1610 FMA3 : VFMADD132SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1585. Inst 1611 FMA3 : VFMADD213SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1586. Inst 1612 FMA3 : VFMADD231SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1587. Inst 1613 FMA4 : VFMADDPS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1588. Inst 1614 FMA3 : VFMADD132PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1589. Inst 1615 FMA3 : VFMADD213PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1590. Inst 1616 FMA3 : VFMADD231PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1591. Inst 1617 FMA4 : VFMSUBSS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1592. Inst 1618 FMA3 : VFMSUB132SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1593. Inst 1619 FMA3 : VFMSUB213SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1594. Inst 1620 FMA3 : VFMSUB231SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1595. Inst 1621 FMA4 : VFMSUBPS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1596. Inst 1622 FMA3 : VFMSUB132PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1597. Inst 1623 FMA3 : VFMSUB213PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1598. Inst 1624 FMA3 : VFMSUB231PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1599. Inst 1625 FMA4 : VFNMADDSS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1600. Inst 1626 FMA3 : VFNMADD132SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1601. Inst 1627 FMA3 : VFNMADD213SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1602. Inst 1628 FMA3 : VFNMADD231SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1603. Inst 1629 FMA4 : VFNMADDPS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1604. Inst 1630 FMA3 : VFNMADD132PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1605. Inst 1631 FMA3 : VFNMADD213PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1606. Inst 1632 FMA3 : VFNMADD231PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1607. Inst 1633 FMA4 : VFNMSUBSS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1608. Inst 1634 FMA3 : VFNMSUB132SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1609. Inst 1635 FMA3 : VFNMSUB213SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1610. Inst 1636 FMA3 : VFNMSUB231SS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1611. Inst 1637 FMA4 : VFNMSUBPS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1612. Inst 1638 FMA3 : VFNMSUB132PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1613. Inst 1639 FMA3 : VFNMSUB213PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1614. Inst 1640 FMA3 : VFNMSUB231PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1615. Inst 1641 FMA4 : VFMADDSUBPS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1616. Inst 1642 FMA3 : VFMADDSUB132PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1617. Inst 1643 FMA3 : VFMADDSUB213PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1618. Inst 1644 FMA3 : VFMADDSUB231PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1619. Inst 1645 FMA4 : VFMSUBADDPS xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1620. Inst 1646 FMA3 : VFMSUBADD132PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1621. Inst 1647 FMA3 : VFMSUBADD213PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1622. Inst 1648 FMA3 : VFMSUBADD231PS xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1623. Inst 1649 FMA4 : VFMADDSD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1624. Inst 1650 FMA3 : VFMADD132SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1625. Inst 1651 FMA3 : VFMADD213SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1626. Inst 1652 FMA3 : VFMADD231SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1627. Inst 1653 FMA4 : VFMADDPD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1628. Inst 1654 FMA3 : VFMADD132PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1629. Inst 1655 FMA3 : VFMADD213PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1630. Inst 1656 FMA3 : VFMADD231PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1631. Inst 1657 FMA4 : VFMSUBSD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1632. Inst 1658 FMA3 : VFMSUB132SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1633. Inst 1659 FMA3 : VFMSUB213SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1634. Inst 1660 FMA3 : VFMSUB231SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1635. Inst 1661 FMA4 : VFMSUBPD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1636. Inst 1662 FMA3 : VFMSUB132PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1637. Inst 1663 FMA3 : VFMSUB213PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1638. Inst 1664 FMA3 : VFMSUB231PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1639. Inst 1665 FMA4 : VFNMADDSD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1640. Inst 1666 FMA3 : VFNMADD132SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1641. Inst 1667 FMA3 : VFNMADD213SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1642. Inst 1668 FMA3 : VFNMADD231SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1643. Inst 1669 FMA4 : VFNMADDPD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1644. Inst 1670 FMA3 : VFNMADD132PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1645. Inst 1671 FMA3 : VFNMADD213PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1646. Inst 1672 FMA3 : VFNMADD231PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1647. Inst 1673 FMA4 : VFNMSUBSD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1648. Inst 1674 FMA3 : VFNMSUB132SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1649. Inst 1675 FMA3 : VFNMSUB213SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1650. Inst 1676 FMA3 : VFNMSUB231SD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1651. Inst 1677 FMA4 : VFNMSUBPD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1652. Inst 1678 FMA3 : VFNMSUB132PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1653. Inst 1679 FMA3 : VFNMSUB213PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1654. Inst 1680 FMA3 : VFNMSUB231PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1655. Inst 1681 FMA4 : VFMADDSUBPD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1656. Inst 1682 FMA3 : VFMADDSUB132PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1657. Inst 1683 FMA3 : VFMADDSUB213PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1658. Inst 1684 FMA3 : VFMADDSUB231PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1659. Inst 1685 FMA4 : VFMSUBADDPD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1660. Inst 1686 FMA3 : VFMSUBADD132PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1661. Inst 1687 FMA3 : VFMSUBADD213PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1662. Inst 1688 FMA3 : VFMSUBADD231PD xmm,xmm,xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1663. Inst 1689 XOP : VFRCZSS xmm, xmm L: 7.16ns= 15.0c T: 0.48ns= 1.00c
  1664. Inst 1690 XOP : VFRCZPS xmm, xmm L: 7.16ns= 15.0c T: 0.48ns= 1.00c
  1665. Inst 1691 XOP : VFRCZSD xmm, xmm L: 7.16ns= 15.0c T: 0.48ns= 1.00c
  1666. Inst 1692 XOP : VFRCZPD xmm, xmm L: 7.16ns= 15.0c T: 0.48ns= 1.00c
  1667. Inst 1693 XOP : VPCMOV xmm, xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1668. Inst 1694 XOP : VPCOMB xmm, xmm, xmm, imm8 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1669. Inst 1695 XOP : VPCOMB xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1670. Inst 1696 XOP : VPCOMW xmm, xmm, xmm, imm8 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1671. Inst 1697 XOP : VPCOMW xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1672. Inst 1698 XOP : VPCOMD xmm, xmm, xmm, imm8 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1673. Inst 1699 XOP : VPCOMD xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1674. Inst 1700 XOP : VPCOMQ xmm, xmm, xmm, imm8 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1675. Inst 1701 XOP : VPCOMQ xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1676. Inst 1702 XOP : VPCOMUB xmm, xmm, xmm, imm8 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1677. Inst 1703 XOP : VPCOMUB xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1678. Inst 1704 XOP : VPCOMUW xmm, xmm, xmm, imm8 L: 0.36ns= 0.7c T: 0.36ns= 0.75c
  1679. Inst 1705 XOP : VPCOMUW xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1680. Inst 1706 XOP : VPCOMUD xmm, xmm, xmm, imm8 L: 0.35ns= 0.7c T: 0.35ns= 0.74c
  1681. Inst 1707 XOP : VPCOMUD xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1682. Inst 1708 XOP : VPCOMUQ xmm, xmm, xmm, imm8 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  1683. Inst 1709 XOP : VPCOMUQ xm1, xm1, xm2, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1684. Inst 1710 XOP : VPERMIL2PS xm,xm,xm,xm,im L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1685. Inst 1711 XOP : VPERMIL2PD xm,xm,xm,xm,im L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1686. Inst 1712 XOP : VPHADDBW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1687. Inst 1713 XOP : VPHADDBD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1688. Inst 1714 XOP : VPHADDBQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1689. Inst 1715 XOP : VPHADDWD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1690. Inst 1716 XOP : VPHADDWQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1691. Inst 1717 XOP : VPHADDDQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1692. Inst 1718 XOP : VPHADDUBW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1693. Inst 1719 XOP : VPHADDUBD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1694. Inst 1720 XOP : VPHADDUBQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1695. Inst 1721 XOP : VPHADDUWD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1696. Inst 1722 XOP : VPHADDUWQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1697. Inst 1723 XOP : VPHADDUDQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1698. Inst 1724 XOP : VPHSUBBW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1699. Inst 1725 XOP : VPHSUBWD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1700. Inst 1726 XOP : VPHSUBDQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1701. Inst 1727 XOP : VPMACSWW xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1702. Inst 1728 XOP : VPMACSWW xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1703. Inst 1729 XOP : VPMACSWD xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1704. Inst 1730 XOP : VPMACSWD xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1705. Inst 1731 XOP : VPMACSDD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1706. Inst 1732 XOP : VPMACSDD xm1,xm2,xm2,xm1 L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1707. Inst 1733 XOP : VPMACSDQL xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1708. Inst 1734 XOP : VPMACSDQL xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1709. Inst 1735 XOP : VPMACSDQH xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1710. Inst 1736 XOP : VPMACSDQH xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1711. Inst 1737 XOP : VPMACSSWW xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1712. Inst 1738 XOP : VPMACSSWW xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1713. Inst 1739 XOP : VPMACSSWD xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1714. Inst 1740 XOP : VPMACSSWD xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1715. Inst 1741 XOP : VPMACSSDD xmm,xmm,xmm,xmm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1716. Inst 1742 XOP : VPMACSSDD xm1,xm2,xm2,xm1 L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1717. Inst 1743 XOP : VPMACSSDQL xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1718. Inst 1744 XOP : VPMACSSDQL xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1719. Inst 1745 XOP : VPMACSSDQH xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1720. Inst 1746 XOP : VPMACSSDQH xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1721. Inst 1747 XOP : VPMADCSWD xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1722. Inst 1748 XOP : VPMADCSWD xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1723. Inst 1749 XOP : VPMADCSSWD xmm,xmm,xmm,xmm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1724. Inst 1750 XOP : VPMADCSSWD xm1,xm2,xm2,xm1 L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1725. Inst 1751 XOP : VPPERM xmm, xmm, xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1726. Inst 1752 XOP : VPROTB xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1727. Inst 1753 XOP : VPROTB xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1728. Inst 1754 XOP : VPROTW xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1729. Inst 1755 XOP : VPROTW xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1730. Inst 1756 XOP : VPROTD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1731. Inst 1757 XOP : VPROTD xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1732. Inst 1758 XOP : VPROTQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1733. Inst 1759 XOP : VPROTQ xmm, xmm, imm8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1734. Inst 1760 XOP : VPSHAB xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1735. Inst 1761 XOP : VPSHAW xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1736. Inst 1762 XOP : VPSHAD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1737. Inst 1763 XOP : VPSHAQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1738. Inst 1764 XOP : VPSHLB xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1739. Inst 1765 XOP : VPSHLW xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1740. Inst 1766 XOP : VPSHLD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1741. Inst 1767 XOP : VPSHLQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  1742. Inst 1768 F16C : VCVTPS2PH xmm, xmm, imm8 L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1743. Inst 1769 F16C : VCVTPH2PS xmm, xmm L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1744. Inst 1770 AVX : VMOVAPS ymm, ymm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1745. Inst 1771 AVX : VMOVAPS ymm, [m256] L: [memory dep.] T: 0.72ns= 1.50c
  1746. Inst 1772 AVX : VMOVAPS [m256], ymm L: [memory dep.] T: 1.43ns= 3.00c
  1747. Inst 1773 AVX : VMOVAPS LS pair L: 7.87ns= 16.5c T: 1.15ns= 2.42c
  1748. Inst 1774 AVX : VMOVUPS ymm, ymm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1749. Inst 1775 AVX : VMOVUPS ymm, [m256] L: [memory dep.] T: 0.72ns= 1.50c
  1750. Inst 1776 AVX : VMOVUPS [m256], ymm L: [memory dep.] T: 1.43ns= 3.00c
  1751. Inst 1777 AVX : VMOVUPS aligned LS pair L: 7.87ns= 16.5c T: 3.14ns= 6.58c
  1752. Inst 1778 AVX : VMOVUPS ymm, [m256 + 4] L: [memory dep.] T: 1.43ns= 3.00c
  1753. Inst 1779 AVX : VMOVUPS [m256 + 4], ymm L: [memory dep.] T: 2.86ns= 6.00c
  1754. Inst 1780 AVX : VMOVUPS unaligned LS pair L: 8.95ns= 18.8c T: 2.90ns= 6.08c
  1755. Inst 1781 AVX : VMOVSLDUP ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1756. Inst 1782 AVX : VMOVSHDUP ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1757. Inst 1783 AVX : VMOVNTPS [m256], ymm L: [memory dep.] T: 4.25ns= 4.25c
  1758. Inst 1784 AVX : VMOVMSKPS r32, ymm L: [diff. reg. set] T: 0.72ns= 1.50c
  1759. Inst 1785 AVX : VMASKMOVPS ymm,ymm,[m256+4] L: [memory dep.] T: 1.43ns= 3.00c
  1760. Inst 1786 AVX : VMASKMOVPS [m256+4],ymm,ymm L: [memory dep.] T: 11.45ns= 24.00c
  1761. Inst 1787 AVX : VMASKMOVPS unaligned LSpair L: 28.63ns= 60.0c T: 11.45ns= 24.00c
  1762. Inst 1788 AVX : VUNPCKLPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1763. Inst 1789 AVX : VUNPCKHPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1764. Inst 1790 AVX : VSHUFPS ymm, ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1765. Inst 1791 AVX : VPERMILPS ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  1766. Inst 1792 AVX : VPERMILPS ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1767. Inst 1793 AVX : VCMPPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1768. Inst 1794 AVX : VADDSUBPS ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1769. Inst 1795 AVX : VHSUBPS ymm, ymm, ymm L: 7.87ns= 16.5c T: 2.86ns= 6.00c
  1770. Inst 1796 AVX : VHADDPS ymm, ymm, ymm L: 7.87ns= 16.5c T: 2.86ns= 6.00c
  1771. Inst 1797 AVX : VSUBPS ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1772. Inst 1798 AVX : VADDPS ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1773. Inst 1799 AVX : VMULPS ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1774. Inst 1800 AVX : VMULPS+VADDPS ymm, ymm, ymm L: 7.16ns= 15.0c T: 1.43ns= 3.00c
  1775. Inst 1801 AVX : VMULPS ymm1.. VADDPS ymm2.. L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1776. Inst 1802 AVX : VMAXPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1777. Inst 1803 AVX : VMINPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1778. Inst 1804 AVX : VANDNPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1779. Inst 1805 AVX : VANDNPS ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1780. Inst 1806 AVX : VANDPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1781. Inst 1807 AVX : VANDPS ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1782. Inst 1808 AVX : VORPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1783. Inst 1809 AVX : VORPS ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1784. Inst 1810 AVX : VXORPS ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1785. Inst 1811 AVX : VXORPS ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1786. Inst 1812 AVX : VDIVPS ymm, ymm, ymm L: 10.02ns= 21.0c T: 6.44ns= 13.50c
  1787. Inst 1813 AVX : VDIVPS (0.0f/x) L: 10.02ns= 21.0c T: 6.44ns= 13.50c
  1788. Inst 1814 AVX : VDIVPS (x/1.0f) L: 10.02ns= 21.0c T: 6.44ns= 13.50c
  1789. Inst 1815 AVX : VDIVPS (x/2.0f) L: 9.38ns= 19.7c T: 6.44ns= 13.50c
  1790. Inst 1816 AVX : VDIVPS (x/0.5f) L: 9.38ns= 19.7c T: 6.44ns= 13.50c
  1791. Inst 1817 AVX : VSQRTPS ymm, ymm L: 10.73ns= 22.5c T: 7.16ns= 15.00c
  1792. Inst 1818 AVX : VSQRTPS (0.0f) L: 10.73ns= 22.5c T: 7.16ns= 15.00c
  1793. Inst 1819 AVX : VSQRTPS (1.0f) L: 10.73ns= 22.5c T: 7.16ns= 15.00c
  1794. Inst 1820 AVX : VRCPPS ymm, ymm, ymm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1795. Inst 1821 AVX : VRSQRTPS ymm, ymm, ymm L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1796. Inst 1822 AVX : VBLENDPS ymm, ymm, ymm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1797. Inst 1823 AVX : VBLENDVPS ymm, ymm, ymm, ym L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1798. Inst 1824 AVX : VDPPS ymm, ymm, ymm, imm8 L: 19.32ns= 40.5c T: 3.54ns= 7.42c
  1799. Inst 1825 AVX : VPTESTPS ymm, ymm L: [no true dep.] T: 1.43ns= 3.00c
  1800. Inst 1826 AVX : VROUNDPS ymm, ymm, imm8 L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  1801. Inst 1827 AVX : VMOVAPD ymm, ymm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1802. Inst 1828 AVX : VMOVAPD ymm, [m256] L: [memory dep.] T: 0.72ns= 1.50c
  1803. Inst 1829 AVX : VMOVAPD [m256], ymm L: [memory dep.] T: 1.43ns= 3.00c
  1804. Inst 1830 AVX : VMOVAPD LS pair L: 7.87ns= 16.5c T: 1.11ns= 2.33c
  1805. Inst 1831 AVX : VMOVUPD ymm, ymm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1806. Inst 1832 AVX : VMOVUPD ymm, [m256] L: [memory dep.] T: 0.72ns= 1.50c
  1807. Inst 1833 AVX : VMOVUPD [m256], ymm L: [memory dep.] T: 1.43ns= 3.00c
  1808. Inst 1834 AVX : VMOVUPD aligned LS pair L: 7.87ns= 16.5c T: 0.72ns= 1.50c
  1809. Inst 1835 AVX : VMOVUPD ymm, [m256 + 4] L: [memory dep.] T: 1.43ns= 3.00c
  1810. Inst 1836 AVX : VMOVUPD [m256 + 4], ymm L: [memory dep.] T: 2.86ns= 6.00c
  1811. Inst 1837 AVX : VMOVUPD unaligned LS pair L: 8.95ns= 18.8c T: 2.94ns= 6.17c
  1812. Inst 1838 AVX : VMOVDDUP ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1813. Inst 1839 AVX : VMOVNTPD [m256], ymm L: [memory dep.] T: 4.25ns= 4.25c
  1814. Inst 1840 AVX : VMOVMSKPD r32, ymm L: [diff. reg. set] T: 0.72ns= 1.50c
  1815. Inst 1841 AVX : VMASKMOVPD ymm,ymm,[m256+4] L: [memory dep.] T: 1.43ns= 3.00c
  1816. Inst 1842 AVX : VMASKMOVPD [m256+4],ymm,ymm L: [memory dep.] T: 11.45ns= 24.00c
  1817. Inst 1843 AVX : VMASKMOVPD unaligned LSpair L: 28.63ns= 60.0c T: 11.45ns= 24.00c
  1818. Inst 1844 AVX : VUNPCKLPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1819. Inst 1845 AVX : VUNPCKHPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1820. Inst 1846 AVX : VSHUFPD ymm, ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1821. Inst 1847 AVX : VPERMILPD ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  1822. Inst 1848 AVX : VPERMILPD ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1823. Inst 1849 AVX : VCMPPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1824. Inst 1850 AVX : VADDSUBPD ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1825. Inst 1851 AVX : VHSUBPD ymm, ymm, ymm L: 7.87ns= 16.5c T: 2.86ns= 6.00c
  1826. Inst 1852 AVX : VHADDPD ymm, ymm, ymm L: 7.87ns= 16.5c T: 2.86ns= 6.00c
  1827. Inst 1853 AVX : VSUBPD ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1828. Inst 1854 AVX : VADDPD ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1829. Inst 1855 AVX : VMULPD ymm, ymm, ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1830. Inst 1856 AVX : VMULPD+VADDPD ymm, ymm, ymm L: 7.16ns= 15.0c T: 1.43ns= 3.00c
  1831. Inst 1857 AVX : VMULPD ymm1.. VADDPD ymm2.. L: 3.58ns= 7.5c T: 1.43ns= 3.00c
  1832. Inst 1858 AVX : VMAXPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1833. Inst 1859 AVX : VMINPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1834. Inst 1860 AVX : VANDNPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1835. Inst 1861 AVX : VANDNPD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1836. Inst 1862 AVX : VANDPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1837. Inst 1863 AVX : VANDPD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1838. Inst 1864 AVX : VORPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1839. Inst 1865 AVX : VORPD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1840. Inst 1866 AVX : VXORPD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1841. Inst 1867 AVX : VXORPD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1842. Inst 1868 AVX : VDIVPD ymm, ymm, ymm L: 17.18ns= 36.0c T: 13.60ns= 28.50c
  1843. Inst 1869 AVX : VDIVPD (0.0/x) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1844. Inst 1870 AVX : VDIVPD (x/1.0) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1845. Inst 1871 AVX : VDIVPD (x/2.0) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1846. Inst 1872 AVX : VDIVPD (x/0.5) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1847. Inst 1873 AVX : VSQRTPD ymm, ymm L: 17.89ns= 37.5c T: 14.31ns= 30.00c
  1848. Inst 1874 AVX : VSQRTPD (0.0) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1849. Inst 1875 AVX : VSQRTPD (1.0) L: 6.44ns= 13.5c T: 6.44ns= 13.50c
  1850. Inst 1876 AVX : VBLENDPD ymm, ymm, ymm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1851. Inst 1877 AVX : VBLENDVPD ymm, ymm, ymm, ym L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1852. Inst 1878 AVX : VCVTDQ2PD ymm, xmm L: 4.29ns= 9.0c T: 1.43ns= 3.00c
  1853. Inst 1879 AVX : VCVTPD2DQ xmm, ymm L: 5.72ns= 12.0c T: 1.43ns= 3.00c
  1854. Inst 1880 AVX : VCVTPD2DQ + VCVTDQ2PD L: 10.73ns= 22.5c T: 2.66ns= 5.58c
  1855. Inst 1881 AVX : VCVTTPD2DQ xmm, ymm L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1856. Inst 1882 AVX : VCVTTPD2DQ + VCVTDQ2PD L: 10.73ns= 22.5c T: 2.70ns= 5.67c
  1857. Inst 1883 AVX : VCVTDQ2PS ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  1858. Inst 1884 AVX : VCVTPS2DQ ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  1859. Inst 1885 AVX : VCVTPS2DQ + VCVTDQ2PS L: 5.73ns= 12.0c T: 2.86ns= 6.00c
  1860. Inst 1886 AVX : VCVTTPS2DQ ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  1861. Inst 1887 AVX : VCVTTPS2DQ + VCVTDQ2PS L: 5.73ns= 12.0c T: 2.86ns= 6.00c
  1862. Inst 1888 AVX : VCVTPS2PD ymm, xmm L: 4.29ns= 9.0c T: 1.43ns= 3.00c
  1863. Inst 1889 AVX : VCVTPD2PS xmm, ymm L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1864. Inst 1890 AVX : VCVTPD2PS + VCVTPS2PD L: 10.73ns= 22.5c T: 2.70ns= 5.67c
  1865. Inst 1891 AVX : VPTESTPD ymm, ymm L: [no true dep.] T: 1.43ns= 3.00c
  1866. Inst 1892 AVX : VROUNDPD ymm, ymm, imm8 L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  1867. Inst 1893 AVX : VBROADCASTSS ymm, m32 L: [memory dep.] T: 0.36ns= 0.75c
  1868. Inst 1894 AVX : VBROADCASTSD ymm, m64 L: [memory dep.] T: 0.36ns= 0.75c
  1869. Inst 1895 AVX : VBROADCASTF128 ymm, m128 L: [memory dep.] T: 0.37ns= 0.77c
  1870. Inst 1896 AVX : VEXTRACTF128 xmm, ymm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1871. Inst 1897 AVX : VINSERTF128 ym, ym, xm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  1872. Inst 1898 AVX : VPERM2F128 ym, ym, ym, im8 L: 3.58ns= 7.5c T: 2.27ns= 4.75c
  1873. Inst 1899 AVX : VMOVDQA ymm, ymm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1874. Inst 1900 AVX : VMOVDQA ymm, [m256] L: [memory dep.] T: 0.72ns= 1.50c
  1875. Inst 1901 AVX : VMOVDQA [m256], ymm L: [memory dep.] T: 1.43ns= 3.00c
  1876. Inst 1902 AVX : VMOVDQA LS pair L: 7.87ns= 16.5c T: 1.67ns= 3.50c
  1877. Inst 1903 AVX : VMOVDQU ymm, ymm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1878. Inst 1904 AVX : VMOVDQU ymm, [m256] L: [memory dep.] T: 0.72ns= 1.50c
  1879. Inst 1905 AVX : VMOVDQU [m256], ymm L: [memory dep.] T: 1.43ns= 3.00c
  1880. Inst 1906 AVX : VMOVDQU aligned LS pair L: 7.87ns= 16.5c T: 1.79ns= 3.75c
  1881. Inst 1907 AVX : VMOVDQU ymm, [m256 + 4] L: [memory dep.] T: 1.43ns= 3.00c
  1882. Inst 1908 AVX : VMOVDQU [m256 + 4], ymm L: [memory dep.] T: 2.86ns= 6.00c
  1883. Inst 1909 AVX : VMOVDQU unaligned LS pair L: 8.95ns= 18.8c T: 2.94ns= 6.17c
  1884. Inst 1910 AVX : VMOVNTDQ [m256], ymm L: [memory dep.] T: 4.25ns= 4.25c
  1885. Inst 1911 AVX : VLDDQU ymm, [m256 + 4] L: [memory dep.] T: 1.43ns= 3.00c
  1886. Inst 1912 AVX : VZEROUPPER L: [no true dep.] T: 4.29ns= 9.00c
  1887. Inst 1913 AVX : VZEROALL L: [no true dep.] T: 0.00ns= 0.00c
  1888. Inst 1914 FMA4 : VFMADDPS ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1889. Inst 1915 FMA3 : VFMADD132PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1890. Inst 1916 FMA3 : VFMADD213PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1891. Inst 1917 FMA3 : VFMADD231PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1892. Inst 1918 FMA4 : VFMSUBPS ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1893. Inst 1919 FMA3 : VFMSUB132PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1894. Inst 1920 FMA3 : VFMSUB213PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1895. Inst 1921 FMA3 : VFMSUB231PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1896. Inst 1922 FMA4 : VFNMADDPS ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1897. Inst 1923 FMA3 : VFNMADD132PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1898. Inst 1924 FMA3 : VFNMADD213PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1899. Inst 1925 FMA3 : VFNMADD231PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1900. Inst 1926 FMA4 : VFNMSUBPS ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1901. Inst 1927 FMA3 : VFNMSUB132PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1902. Inst 1928 FMA3 : VFNMSUB213PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1903. Inst 1929 FMA3 : VFNMSUB231PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1904. Inst 1930 FMA4 : VFMADDSUBPS ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1905. Inst 1931 FMA3 : VFMADDSUB132PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1906. Inst 1932 FMA3 : VFMADDSUB213PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1907. Inst 1933 FMA3 : VFMADDSUB231PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1908. Inst 1934 FMA4 : VFMSUBADDPS ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1909. Inst 1935 FMA3 : VFMSUBADD132PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1910. Inst 1936 FMA3 : VFMSUBADD213PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1911. Inst 1937 FMA3 : VFMSUBADD231PS ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1912. Inst 1938 FMA4 : VFMADDPD ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1913. Inst 1939 FMA3 : VFMADD132PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1914. Inst 1940 FMA3 : VFMADD213PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1915. Inst 1941 FMA3 : VFMADD231PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1916. Inst 1942 FMA4 : VFMSUBPD ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1917. Inst 1943 FMA3 : VFMSUB132PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1918. Inst 1944 FMA3 : VFMSUB213PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1919. Inst 1945 FMA3 : VFMSUB231PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1920. Inst 1946 FMA4 : VFNMADDPD ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1921. Inst 1947 FMA3 : VFNMADD132PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1922. Inst 1948 FMA3 : VFNMADD213PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1923. Inst 1949 FMA3 : VFNMADD231PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1924. Inst 1950 FMA4 : VFNMSUBPD ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1925. Inst 1951 FMA3 : VFNMSUB132PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1926. Inst 1952 FMA3 : VFNMSUB213PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1927. Inst 1953 FMA3 : VFNMSUB231PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1928. Inst 1954 FMA4 : VFMADDSUBPD ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1929. Inst 1955 FMA3 : VFMADDSUB132PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1930. Inst 1956 FMA3 : VFMADDSUB213PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1931. Inst 1957 FMA3 : VFMADDSUB231PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1932. Inst 1958 FMA4 : VFMSUBADDPD ymm,ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1933. Inst 1959 FMA3 : VFMSUBADD132PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1934. Inst 1960 FMA3 : VFMSUBADD213PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1935. Inst 1961 FMA3 : VFMSUBADD231PD ymm,ymm,ymm L: 3.58ns= 7.5c T: 0.72ns= 1.50c
  1936. Inst 1962 XOP : VFRCZSD ymm, ymm L: 7.16ns= 15.0c T: 1.43ns= 3.00c
  1937. Inst 1963 XOP : VFRCZPD ymm, ymm L: 7.16ns= 15.0c T: 1.43ns= 3.00c
  1938. Inst 1964 XOP : VPCMOV ymm, ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1939. Inst 1965 XOP : VPERMIL2PS ym,ym,ym,ym,im L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  1940. Inst 1966 XOP : VPERMIL2PD ym,ym,ym,ym,im L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  1941. Inst 1967 F16C : VCVTPS2PH + VCVTPH2PS L: 5.73ns= 12.0c T: 2.86ns= 6.00c
  1942. Inst 1968 F16C : VCVTPS2PH xmm, ymm, imm8 L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1943. Inst 1969 F16C : VCVTPH2PS ymm, xmm L: 5.73ns= 12.0c T: 1.43ns= 3.00c
  1944. Inst 1970 F16C : VCVTPS2PH + VCVTPH2PS L: 5.73ns= 12.0c T: 2.86ns= 6.00c
  1945. Inst 1971 RDRAND: RDRAND r16 L: [no true dep.] T:2916.70ns=6113.58c
  1946. Inst 1972 RDRAND: RDRAND r32 L: [no true dep.] T:3020.47ns=6331.08c
  1947. Inst 1973 RDRAND: RDRAND r64 L: [no true dep.] T:6817.63ns=14290.17c
  1948. Inst 1974 X86 : MOV+ADD r8, r8 L: 1.43ns= 3.0c T: 0.23ns= 0.48c
  1949. Inst 1975 X86 : MOV+ADD r16, r16 L: 1.43ns= 3.0c T: 0.36ns= 0.76c
  1950. Inst 1976 X86 : MOV+ADD r32, r32 L: 1.07ns= 2.3c T: 0.36ns= 0.75c
  1951. Inst 1977 AMD64 : MOV+ADD r64, r64 L: 1.07ns= 2.3c T: 0.36ns= 0.75c
  1952. Inst 1978 MMX : MOVQ+PADDB mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1953. Inst 1979 MMX : MOVQ+PADDW mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1954. Inst 1980 MMX : MOVQ+PADDD mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1955. Inst 1981 SSE2 : MOVQ+PADDQ mm, mm L: 2.86ns= 6.0c T: 0.72ns= 1.50c
  1956. Inst 1983 SSE : MOVSS+ADDSS xmm, xmm L: 5.73ns= 12.0c T: 0.48ns= 1.00c
  1957. Inst 1984 AVX : VMOVSS+VADDSS xm, xm, xm L: 5.73ns= 12.0c T: 0.48ns= 1.00c
  1958. Inst 1985 SSE : MOVAPS+ADDPS xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1959. Inst 1986 AVX : VMOVAPS+VADDPS xm, xm, xm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1960. Inst 1987 SSE2 : MOVSD+ADDSD xmm, xmm L: 5.73ns= 12.0c T: 0.48ns= 1.00c
  1961. Inst 1988 AVX : VMOVSD+VADDSD xm, xm, xm L: 5.73ns= 12.0c T: 0.48ns= 1.00c
  1962. Inst 1989 SSE2 : MOVAPD+ADDPD xmm, xmm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1963. Inst 1990 AVX : VMOVAPD+VADDPD xm, xm, xm L: 3.58ns= 7.5c T: 0.44ns= 0.92c
  1964. Inst 1991 SSE2 : MOVDQA+PADDB xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1965. Inst 1992 SSE2 : MOVDQA+PADDW xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1966. Inst 1993 SSE2 : MOVDQA+PADDD xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1967. Inst 1994 SSE2 : MOVDQA+PADDQ xmm, xmm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1968. Inst 1995 AVX : VMOVDQA+VPADDB xm, xm, xm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1969. Inst 1996 AVX : VMOVDQA+VPADDW xm, xm, xm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1970. Inst 1997 AVX : VMOVDQA+VPADDD xm, xm, xm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1971. Inst 1998 AVX : VMOVDQA+VPADDQ xm, xm, xm L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1972. Inst 1999 AVX : VMOVAPS+VADDPS ym, ym, ym L: 7.16ns= 15.0c T: 0.48ns= 1.00c
  1973. Inst 2000 AVX : VMOVAPD+VADDPD ym, ym, ym L: 7.16ns= 15.0c T: 0.48ns= 1.00c
  1974. Inst 2004 BMI : ANDN r32, r32, r32 L: 0.72ns= 1.5c T: 0.20ns= 0.42c
  1975. Inst 2005 BMI : ANDN r64, r64, r64 L: 0.72ns= 1.5c T: 0.19ns= 0.40c
  1976. Inst 2006 BMI : BEXTR r32, r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1977. Inst 2007 BMI : BEXTR r64, r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1978. Inst 2008 BMI : BLSI r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1979. Inst 2009 BMI : BLSI r64, r64 L: 1.43ns= 3.0c T: 0.36ns= 0.76c
  1980. Inst 2010 BMI : BLSMSK r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  1981. Inst 2011 BMI : BLSMSK r64, r64 L: 1.43ns= 3.0c T: 0.36ns= 0.76c
  1982. Inst 2012 BMI : BLSR r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.76c
  1983. Inst 2013 BMI : BLSR r64, r64 L: 1.43ns= 3.0c T: 0.36ns= 0.76c
  1984. Inst 2014 BMI : TZCNT r16, r16 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1985. Inst 2015 BMI : TZCNT r32, r32 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1986. Inst 2016 BMI : TZCNT r64, r64 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  1987. Inst 2017 BMI2 : BZHI r32, r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1988. Inst 2018 BMI2 : BZHI r64, r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1989. Inst 2019 BMI2 : MULX r32, r32, r32 L: 3.58ns= 7.5c T: 1.51ns= 3.17c
  1990. Inst 2020 BMI2 : MULX r64, r64, r64 L: 5.01ns= 10.5c T: 2.86ns= 6.00c
  1991. Inst 2021 BMI2 : PDEP r32, r32, r32 L: 14.67ns= 30.8c T: 14.67ns= 30.75c
  1992. Inst 2022 BMI2 : PDEP r64, r64, r64 L: 14.67ns= 30.8c T: 14.67ns= 30.75c
  1993. Inst 2023 BMI2 : PEXT r32, r32, r32 L: 14.67ns= 30.8c T: 14.67ns= 30.75c
  1994. Inst 2024 BMI2 : PEXT r64, r64, r64 L: 14.67ns= 30.8c T: 14.67ns= 30.75c
  1995. Inst 2025 BMI2 : RORX r32, r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1996. Inst 2026 BMI2 : RORX r64, r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1997. Inst 2027 BMI2 : SARX r32, r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1998. Inst 2028 BMI2 : SARX r64, r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  1999. Inst 2029 BMI2 : SHLX r32, r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  2000. Inst 2030 BMI2 : SHLX r64, r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  2001. Inst 2031 BMI2 : SHRX r32, r32, r32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  2002. Inst 2032 BMI2 : SHRX r64, r64, r64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  2003. Inst 2033 TBM : BEXTR r32, r32, imm32 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  2004. Inst 2034 TBM : BEXTR r64, r64, imm64 L: 0.72ns= 1.5c T: 0.36ns= 0.75c
  2005. Inst 2035 TBM : BLCFILL r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2006. Inst 2036 TBM : BLCFILL r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2007. Inst 2037 TBM : BLCI r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2008. Inst 2038 TBM : BLCI r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2009. Inst 2039 TBM : BLCIC r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2010. Inst 2040 TBM : BLCIC r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2011. Inst 2041 TBM : BLCMSK r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2012. Inst 2042 TBM : BLCMSK r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2013. Inst 2043 TBM : BLCS r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2014. Inst 2044 TBM : BLCS r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2015. Inst 2045 TBM : BLSFILL r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2016. Inst 2046 TBM : BLSFILL r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2017. Inst 2047 TBM : BLSIC r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2018. Inst 2048 TBM : BLSIC r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2019. Inst 2049 TBM : T1MSKC r32, r32 L: 1.43ns= 3.0c T: 0.38ns= 0.79c
  2020. Inst 2050 TBM : T1MSKC r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2021. Inst 2051 TBM : TZMSK r32, r32 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2022. Inst 2052 TBM : TZMSK r64, r64 L: 1.43ns= 3.0c T: 0.37ns= 0.78c
  2023. Inst 2053 AVX2 : VMOVNTDQA ymm, [m256] L: [memory dep.] T: 1.50ns= 1.50c
  2024. Inst 2054 AVX2 : VMOVNTDQA + VMOVNTDQ ymm L: 7.87ns= 16.5c T: 16.50ns= 16.50c
  2025. Inst 2055 AVX2 : VPMOVMSKB r32, ymm L: [diff. reg. set] T: 0.72ns= 1.50c
  2026. Inst 2056 AVX2 : VPMOVMSKB r64, ymm L: [diff. reg. set] T: 0.72ns= 1.50c
  2027. Inst 2057 AVX2 : VPADDB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2028. Inst 2058 AVX2 : VPADDW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2029. Inst 2059 AVX2 : VPADDD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2030. Inst 2060 AVX2 : VPADDQ ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2031. Inst 2061 AVX2 : VPADDSB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2032. Inst 2062 AVX2 : VPADDSW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2033. Inst 2063 AVX2 : VPADDUSB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2034. Inst 2064 AVX2 : VPADDUSW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2035. Inst 2065 AVX2 : VPSUBB ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2036. Inst 2066 AVX2 : VPSUBB ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2037. Inst 2067 AVX2 : VPSUBW ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2038. Inst 2068 AVX2 : VPSUBW ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2039. Inst 2069 AVX2 : VPSUBD ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2040. Inst 2070 AVX2 : VPSUBD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2041. Inst 2071 AVX2 : VPSUBQ ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2042. Inst 2072 AVX2 : VPSUBQ ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2043. Inst 2073 AVX2 : VPSUBSB ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2044. Inst 2074 AVX2 : VPSUBSB ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2045. Inst 2075 AVX2 : VPSUBSW ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2046. Inst 2076 AVX2 : VPSUBSW ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2047. Inst 2077 AVX2 : VPSUBUSB ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2048. Inst 2078 AVX2 : VPSUBUSB ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2049. Inst 2079 AVX2 : VPSUBUSW ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2050. Inst 2080 AVX2 : VPSUBUSW ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2051. Inst 2081 AVX2 : VPCMPEQB ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2052. Inst 2082 AVX2 : VPCMPEQB ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2053. Inst 2083 AVX2 : VPCMPEQW ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2054. Inst 2084 AVX2 : VPCMPEQW ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2055. Inst 2085 AVX2 : VPCMPEQD ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2056. Inst 2086 AVX2 : VPCMPEQD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2057. Inst 2087 AVX2 : VPCMPEQQ ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2058. Inst 2088 AVX2 : VPCMPEQQ ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2059. Inst 2089 AVX2 : VPCMPGTB ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2060. Inst 2090 AVX2 : VPCMPGTB ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2061. Inst 2091 AVX2 : VPCMPGTW ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2062. Inst 2092 AVX2 : VPCMPGTW ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2063. Inst 2093 AVX2 : VPCMPGTD ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2064. Inst 2094 AVX2 : VPCMPGTD ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2065. Inst 2095 AVX2 : VPCMPGTQ ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2066. Inst 2096 AVX2 : VPCMPGTQ ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2067. Inst 2097 AVX2 : VPAND ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2068. Inst 2098 AVX2 : VPAND ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2069. Inst 2099 AVX2 : VPANDN ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2070. Inst 2100 AVX2 : VPANDN ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2071. Inst 2101 AVX2 : VPOR ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2072. Inst 2102 AVX2 : VPOR ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2073. Inst 2103 AVX2 : VPXOR ymm, ymm, ymm L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2074. Inst 2104 AVX2 : VPXOR ymm1, ymm1, ymm2 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2075. Inst 2105 AVX2 : VPMULHW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2076. Inst 2106 AVX2 : VPMULHUW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2077. Inst 2107 AVX2 : VPMULHRSW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2078. Inst 2108 AVX2 : VPMULLW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2079. Inst 2109 AVX2 : VPMULLD ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2080. Inst 2110 AVX2 : VPMULDQ ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2081. Inst 2111 AVX2 : VPMULUDQ ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2082. Inst 2112 AVX2 : VPMADDUBSW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2083. Inst 2113 AVX2 : VPMADDWD ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2084. Inst 2114 AVX2 : VPSLLW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2085. Inst 2115 AVX2 : VPSLLW ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2086. Inst 2116 AVX2 : VPSLLD ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2087. Inst 2117 AVX2 : VPSLLD ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2088. Inst 2118 AVX2 : VPSLLQ ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2089. Inst 2119 AVX2 : VPSLLQ ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2090. Inst 2120 AVX2 : VPSLLDQ ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2091. Inst 2121 AVX2 : VPSRAW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2092. Inst 2122 AVX2 : VPSRAW ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2093. Inst 2123 AVX2 : VPSRAD ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2094. Inst 2124 AVX2 : VPSRAD ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2095. Inst 2125 AVX2 : VPSRLW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2096. Inst 2126 AVX2 : VPSRLW ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2097. Inst 2127 AVX2 : VPSRLD ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2098. Inst 2128 AVX2 : VPSRLD ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2099. Inst 2129 AVX2 : VPSRLQ ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2100. Inst 2130 AVX2 : VPSRLQ ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2101. Inst 2131 AVX2 : VPSRLDQ ymm, ymm, imm8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2102. Inst 2132 AVX2 : VPUNPCKHBW ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2103. Inst 2133 AVX2 : VPUNPCKHWD ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2104. Inst 2134 AVX2 : VPUNPCKHDQ ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2105. Inst 2135 AVX2 : VPUNPCKHQDQ ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2106. Inst 2136 AVX2 : VPUNPCKLBW ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2107. Inst 2137 AVX2 : VPUNPCKLWD ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2108. Inst 2138 AVX2 : VPUNPCKLDQ ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2109. Inst 2139 AVX2 : VPUNPCKLQDQ ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2110. Inst 2140 AVX2 : VPACKSSWB ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2111. Inst 2141 AVX2 : VPACKUSWB ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2112. Inst 2142 AVX2 : VPACKSSDW ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2113. Inst 2143 AVX2 : VPACKUSDW ymm, ymm, ymm L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2114. Inst 2144 AVX2 : VPAVGB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2115. Inst 2145 AVX2 : VPAVGW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2116. Inst 2146 AVX2 : VPMAXUB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2117. Inst 2147 AVX2 : VPMAXSB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2118. Inst 2148 AVX2 : VPMAXUW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2119. Inst 2149 AVX2 : VPMAXSW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2120. Inst 2150 AVX2 : VPMAXUD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2121. Inst 2151 AVX2 : VPMAXSD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2122. Inst 2152 AVX2 : VPMINUB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2123. Inst 2153 AVX2 : VPMINSB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2124. Inst 2154 AVX2 : VPMINUW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2125. Inst 2155 AVX2 : VPMINSW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2126. Inst 2156 AVX2 : VPMINUD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2127. Inst 2157 AVX2 : VPMINSD ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2128. Inst 2158 AVX2 : VPSADBW ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2129. Inst 2159 AVX2 : VPSHUFB ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2130. Inst 2160 AVX2 : VPSHUFLW ymm, ymm, im8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2131. Inst 2161 AVX2 : VPSHUFHW ymm, ymm, im8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2132. Inst 2162 AVX2 : VPSHUFD ymm, ymm, im8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2133. Inst 2163 AVX2 : VPABSB ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2134. Inst 2164 AVX2 : VPABSW ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2135. Inst 2165 AVX2 : VPABSD ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2136. Inst 2166 AVX2 : VPALIGNR ymm, ymm, ymm, im8 L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2137. Inst 2167 AVX2 : VPHADDW ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2138. Inst 2168 AVX2 : VPHADDD ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2139. Inst 2169 AVX2 : VPHADDSW ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2140. Inst 2170 AVX2 : VPHSUBW ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2141. Inst 2171 AVX2 : VPHSUBD ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2142. Inst 2172 AVX2 : VPHSUBSW ymm, ymm, ymm L: 3.58ns= 7.5c T: 2.86ns= 6.00c
  2143. Inst 2173 AVX2 : VPSIGNB ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2144. Inst 2174 AVX2 : VPSIGNW ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2145. Inst 2175 AVX2 : VPSIGND ymm, ymm, ymm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2146. Inst 2176 AVX2 : VPBLENDW ymm, ymm, ymm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2147. Inst 2177 AVX2 : VPBLENDVB ymm, ymm, ymm, ym L: 1.43ns= 3.0c T: 1.43ns= 3.00c
  2148. Inst 2178 AVX2 : VPBLENDD xmm, xmm, xmm, im8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2149. Inst 2179 AVX2 : VPBLENDD ymm, ymm, ymm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2150. Inst 2180 AVX2 : VMPSADBW ymm, ymm, imm8 L: 8.59ns= 18.0c T: 5.73ns= 12.00c
  2151. Inst 2181 AVX2 : VPMOVSXBW ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2152. Inst 2182 AVX2 : VPMOVSXBD ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2153. Inst 2183 AVX2 : VPMOVSXBQ ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2154. Inst 2184 AVX2 : VPMOVSXWD ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2155. Inst 2185 AVX2 : VPMOVSXWQ ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2156. Inst 2186 AVX2 : VPMOVSXDQ ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2157. Inst 2187 AVX2 : VPMOVZXBW ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2158. Inst 2188 AVX2 : VPMOVZXBD ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2159. Inst 2189 AVX2 : VPMOVZXBQ ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2160. Inst 2190 AVX2 : VPMOVZXWD ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2161. Inst 2191 AVX2 : VPMOVZXWQ ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2162. Inst 2192 AVX2 : VPMOVZXDQ ymm, xmm L: 2.15ns= 4.5c T: 2.15ns= 4.50c
  2163. Inst 2193 AVX2 : VPMASKMOVD xmm,xmm,[m128+4] L: [memory dep.] T: 0.72ns= 1.50c
  2164. Inst 2194 AVX2 : VPMASKMOVD [m128+4],xmm,xmm L: [memory dep.] T: 5.72ns= 12.00c
  2165. Inst 2195 AVX2 : VPMASKMOVD unaligned LSpair L: 19.56ns= 41.0c T: 5.09ns= 10.67c
  2166. Inst 2196 AVX2 : VPMASKMOVQ xmm,xmm,[m128+4] L: [memory dep.] T: 0.72ns= 1.50c
  2167. Inst 2197 AVX2 : VPMASKMOVQ [m128+4],xmm,xmm L: [memory dep.] T: 5.73ns= 12.00c
  2168. Inst 2198 AVX2 : VPMASKMOVQ unaligned LSpair L: 19.56ns= 41.0c T: 5.73ns= 12.00c
  2169. Inst 2199 AVX2 : VPMASKMOVD ymm,ymm,[m256+4] L: [memory dep.] T: 1.43ns= 3.00c
  2170. Inst 2200 AVX2 : VPMASKMOVD [m256+4],ymm,ymm L: [memory dep.] T: 11.45ns= 24.00c
  2171. Inst 2201 AVX2 : VPMASKMOVD unaligned LSpair L: 28.63ns= 60.0c T: 29.22ns= 61.25c
  2172. Inst 2202 AVX2 : VPMASKMOVQ ymm,ymm,[m256+4] L: [memory dep.] T: 1.43ns= 3.00c
  2173. Inst 2203 AVX2 : VPMASKMOVQ [m256+4],ymm,ymm L: [memory dep.] T: 11.45ns= 24.00c
  2174. Inst 2204 AVX2 : VPMASKMOVQ unaligned LSpair L: 28.63ns= 60.0c T: 29.18ns= 61.17c
  2175. Inst 2205 AVX2 : VBROADCASTSS xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2176. Inst 2206 AVX2 : VBROADCASTSS ymm, xmm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2177. Inst 2207 AVX2 : VBROADCASTSD ymm, xmm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2178. Inst 2208 AVX2 : VPBROADCASTB xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2179. Inst 2209 AVX2 : VPBROADCASTB ymm, xmm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2180. Inst 2210 AVX2 : VPBROADCASTW xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2181. Inst 2211 AVX2 : VPBROADCASTW ymm, xmm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2182. Inst 2212 AVX2 : VPBROADCASTD xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2183. Inst 2213 AVX2 : VPBROADCASTD ymm, xmm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2184. Inst 2214 AVX2 : VPBROADCASTQ xmm, xmm L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2185. Inst 2215 AVX2 : VPBROADCASTQ ymm, xmm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2186. Inst 2216 AVX2 : VBROADCASTI128 ymm, m128 L: [memory dep.] T: 0.36ns= 0.75c
  2187. Inst 2217 AVX2 : VEXTRACTI128 xmm, ymm, imm8 L: 1.43ns= 3.0c T: 0.36ns= 0.75c
  2188. Inst 2218 AVX2 : VINSERTI128 ym, ym, xm, im8 L: 1.43ns= 3.0c T: 0.72ns= 1.50c
  2189. Inst 2219 AVX2 : VPERM2I128 ym, ym, ym, im8 L: 3.58ns= 7.5c T: 2.23ns= 4.67c
  2190. Inst 2220 AVX2 : VPERMD ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2191. Inst 2221 AVX2 : VPERMQ ymm, ymm, imm8 L: 2.86ns= 6.0c T: 1.39ns= 2.92c
  2192. Inst 2222 AVX2 : VPERMPS ymm, ymm, ymm L: 2.86ns= 6.0c T: 1.43ns= 3.00c
  2193. Inst 2223 AVX2 : VPERMPD ymm, ymm, imm8 L: 2.86ns= 6.0c T: 1.35ns= 2.83c
  2194. Inst 2224 AVX2 : VPSLLVD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  2195. Inst 2225 AVX2 : VPSLLVD ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2196. Inst 2226 AVX2 : VPSLLVQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  2197. Inst 2227 AVX2 : VPSLLVQ ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2198. Inst 2228 AVX2 : VPSRLVD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  2199. Inst 2229 AVX2 : VPSRLVD ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2200. Inst 2230 AVX2 : VPSRLVQ xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  2201. Inst 2231 AVX2 : VPSRLVQ ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2202. Inst 2232 AVX2 : VPSRAVD xmm, xmm, xmm L: 2.15ns= 4.5c T: 0.72ns= 1.50c
  2203. Inst 2233 AVX2 : VPSRAVD ymm, ymm, ymm L: 2.15ns= 4.5c T: 1.43ns= 3.00c
  2204. Inst 2238 CLFLUSH: CLFLUSH [mem] L: [memory dep.] T: 148.85ns=312.00c
  2205. Inst 2248 X86 : MOV r1_8, r2_8 L: 0.72ns= 1.5c T: 0.72ns= 1.50c
  2206. Inst 2249 X86 : MOV r1_16, r2_16 L: 0.72ns= 1.5c T: 0.21ns= 0.44c
  2207. Inst 2250 X86 : MOV r1_32, r2_32 L: 0.18ns= 0.4c T: 0.18ns= 0.37c
  2208. Inst 2251 AMD64 : MOV r1_64, r2_64 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2209. Inst 2252 X86 : MOVSX r1_16, r2_8 L: 0.72ns= 1.5c T: 0.20ns= 0.41c
  2210. Inst 2253 X86 : MOVSX r1_32, r2_8 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2211. Inst 2254 AMD64 : MOVSX r1_64, r2_8 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2212. Inst 2255 X86 : MOVSX r1_32, r2_16 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2213. Inst 2256 AMD64 : MOVSX r1_64, r2_16 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2214. Inst 2257 AMD64 : MOVSXD r1_64, r2_32 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2215. Inst 2258 X86 : MOVZX r1_16, r2_8 L: 0.72ns= 1.5c T: 0.20ns= 0.41c
  2216. Inst 2259 X86 : MOVZX r1_32, r2_8 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2217. Inst 2260 AMD64 : MOVZX r1_64, r2_8 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2218. Inst 2261 X86 : MOVZX r1_32, r2_16 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2219. Inst 2262 AMD64 : MOVZX r1_64, r2_16 L: 0.19ns= 0.4c T: 0.19ns= 0.40c
  2220. Inst 2263 MMX : MOVQ mm1, mm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2221. Inst 2264 SSE : MOVSS xmm1, xmm2 L: 1.43ns= 3.0c T: 0.60ns= 1.25c
  2222. Inst 2265 AVX : VMOVSS xmm1, xmm1, xmm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2223. Inst 2266 SSE : MOVAPS xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2224. Inst 2267 AVX : VMOVAPS xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2225. Inst 2268 SSE : MOVUPS xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2226. Inst 2269 AVX : VMOVUPS xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2227. Inst 2270 SSE2 : MOVSD xmm1, xmm2 L: 1.43ns= 3.0c T: 0.60ns= 1.25c
  2228. Inst 2271 AVX : VMOVSD xmm1, xmm1, xmm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2229. Inst 2272 SSE2 : MOVAPD xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2230. Inst 2273 AVX : VMOVAPD xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2231. Inst 2274 SSE2 : MOVUPD xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2232. Inst 2275 AVX : VMOVUPD xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2233. Inst 2276 SSE2 : MOVDQA xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2234. Inst 2277 AVX : VMOVDQA xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2235. Inst 2278 SSE2 : MOVDQU xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2236. Inst 2279 AVX : VMOVDQU xmm1, xmm2 L: 0.18ns= 0.4c T: 0.18ns= 0.38c
  2237. Inst 2280 AVX : VMOVAPS ymm1, ymm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2238. Inst 2281 AVX : VMOVUPS ymm1, ymm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2239. Inst 2282 AVX : VMOVAPD ymm1, ymm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2240. Inst 2283 AVX : VMOVUPD ymm1, ymm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2241. Inst 2284 AVX : VMOVDQA ymm1, ymm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
  2242. Inst 2285 AVX : VMOVDQU ymm1, ymm2 L: 0.36ns= 0.8c T: 0.36ns= 0.75c
RAW Paste Data