Nome e qualifica del proponente del progetto: 
sb_p_2258156
Anno: 
2020
Abstract: 

Convolutional computation kernels are fundamental to today¿s edge computing applications. Multi-threading processor cores are an interesting approach to pursue the highest energy efficiency and lowest hardware cost in edge computing systems, yet they need hardware acceleration schemes to deal with heavy computational workloads like convolutional algorithms. Following a vector approach to accelerate convolutions, this study explores possible alternatives to implement vector coprocessing units, showing the application-dependence of the optimal balance among the hardware architecture parameters.
A set of over 12 different coprocessor acceleration schemes will be designed, implemented in RTL VHDL/SystemVerilog code, synthesized on FPGA and compared with respect to several performance figures. The performance figures will include total clock cycle count and absolute time to execute the convolution kernels, hardware resource utilization and energy efficiency. The goal of the research is to demonstrate that pure data-level parallelism is not the most efficient way to achieve speed, and a hybrid combination of data level parallelism and thread level parallelism is likely to be the optimal choice.

ERC: 
PE7_5
PE6_2
PE6_1
Componenti gruppo di ricerca: 
sb_cp_is_2910263
sb_cp_es_392999
Innovatività: 

Being convolutions highly parallel in nature, as well as extremely important in a variety of edge computing applications, they serve as a prime example to demonstrate the exploitation of different hardware acceleration schemes (coprocessor schemes) in edge computing processor cores.
Our study is agnostic with respect to supply or bias voltage tuning, purely relying on data-level parallelist and thread-level-parallelism for energy efficiency, thus targeting any physical implementation including soft-cores on commercial FPGA devices, as shown in our results.
In all the presently published works the architectural concept is a subset of those covered in our study, and we aim at ahsowing that pure data-level-parallelism exploitation might not be the optimal approach to achieve the best performance and energy efficiency.

Codice Bando: 
2258156

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma