minus-squarePapamousse@beehaw.orgtoOpen Source@lemmy.ml•FFmpeg devs boast of up to 94x performance boost after implementing handwritten AVX-512 assembly codelinkfedilinkarrow-up1·10 days agoI worked in the media broadcasting, we had an internal lib to scale/convert whatever format in real time, and it went from basic operation, to SSE3, to AVX512, to CUDA, and yes crafting some functions/loops wit assembly can give an enormous boost. linkfedilink
I worked in the media broadcasting, we had an internal lib to scale/convert whatever format in real time, and it went from basic operation, to SSE3, to AVX512, to CUDA, and yes crafting some functions/loops wit assembly can give an enormous boost.