Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs 
著者
和文: ハミド レザ ゾフーリ, 丸山直也, Aaron Smith, Motohiko Matsuda, 松岡聡.  
英文: Hamid Reza ZOHOURI, Naoya Maruyama, Aaron Smith, Motohiko Matsuda, SATOSHI MATSUOKA.  
言語 English 
掲載誌/書名
和文: 
英文: 
巻, 号, ページ        
出版年月 2017年3月16日 
出版者
和文: 
英文: 
会議名称
和文: 
英文:Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'16) 
開催地
和文: 
英文:Salt Lake City, UT 
公式リンク https://ieeexplore.ieee.org/abstract/document/7877113/
 
DOI https://doi.org/10.1109/SC.2016.34
アブストラクト We evaluate the power and performance of the Rodinia benchmark suite using the Altera SDK for OpenCL targeting a Stratix V FPGA against a modern CPU and GPU. We study multiple OpenCL kernels per benchmark, ranging from direct ports of the original GPU implementations to loop-pipelined kernels specifically optimized for FPGAs. Based on our results, we find that even though OpenCL is functionally portable across devices, direct ports of GPU-optimized code do not perform well compared to kernels optimized with FPGA-specific techniques such as sliding windows. However, by exploiting FPGA-specific optimizations, it is possible to achieve up to 3.4x better power efficiency using an Altera Stratix V FPGA in comparison to an NVIDIA K20c GPU, and better run time and power efficiency in comparison to CPU. We also present preliminary results for Arria 10, which, due to hardened FPUs, exhibits noticeably better performance compared to Stratix V in floating-point-intensive benchmarks.

©2007 Institute of Science Tokyo All rights reserved.