Advertisement
Guest User

Untitled

a guest
May 26th, 2019
100
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 2.41 KB | None | 0 0
  1. In the past months I have done a lot of benchmarking of dav1d, the AV1 video decoder developed by VideoLAN. Right now most functions are accelerated with hand-writen assembly code, SSSE3, AVX2 but also. I think that there are great marketing oppertunities for both Arm and dav1d, showing of how newer processors improve performance and lower power usage, and how NEON enables performance gains over 100%.
  2.  
  3. In my past testing: https://medium.com/@ewoutterhoeven/how-arms-neon-enables-efficient-av1-decoding-on-mobile-5fcb3a4f6e7f
  4.  
  5. More comparisons could be done, showing how more cores can not only increase performance but also lower power consumption. The same is probably through for newer cores. Furthermore comparisons between different compilers with different levels auto-vectorizations and the hand-written NEON-assembly could be done.
  6.  
  7. Due to the low power usage, low die area and therefore high core counts I believe Arm could be very competative in the video market. Results of tests would be shared and can be used both for marketing and research purposes.
  8.  
  9. ## Hardware ##
  10.  
  11. Arm64 in-order cores
  12. - Cortex-A53
  13. - Cortex-A55
  14.  
  15. Arm64 out-of-order cores
  16. - Cortex-A72
  17. - Cortex-A73
  18. - Cortex-A75
  19. - Cortex-A76
  20. - Cortex-A77
  21.  
  22. ## Tests ##
  23.  
  24. 1. Max. single-threaded performance
  25. dav1d's average frame rate with a single tile and frame thread.
  26.  
  27. KPI: Frames per second
  28.  
  29. 2. Max. multi-threaded performance
  30. dav1d's average frame rate with multiple tile and frame threads.
  31.  
  32. KPI: Frames per second
  33.  
  34. 3. NEON speedup
  35. The speed up from optimized C code to optimized C code with NEON assembly enabled.
  36.  
  37. KPI: Frames per second
  38.  
  39. 4. Power consumption at fixed frame rate:
  40. dav1d will be run at a fixed frame rate. Board power will be measured on the power adapter or over USB.
  41.  
  42. KPI: Watt (average)
  43.  
  44. 5. Power consumption with different thread counts
  45. dav1d will be run at a fixed frame rate
  46.  
  47. KPI: Watt (average)
  48.  
  49. 6. Compilers and autovecotrization performance
  50. dav1d will be run at a fixed frame rate compiled with different compilers (at least GCC and Clang) and different levels of autovectorization, and with and without the NEON assembly enabled.
  51.  
  52. KPI: Frames per second
  53.  
  54. 7. Compilers and autovecotrization power
  55. dav1d will be run at a fixed frame rate compiled with different compilers (at least GCC and Clang) and different levels of autovectorization, and with and without the NEON assembly enabled.
  56.  
  57. KPI: Watt (average)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement