Recently I've checked different methods to convert numbers to binary representation, including use of new PDEP instruction from BMI2 extension.
Today I've updated the article with new SWAR version 2, a tricky use of multiplication. The method is not faster, but I like the approach---in certain conditions multiplication can be seen as multi-shift/bit-or instruction. I've already use multiplication in this way to emulate instruction pmovmskb.
Subskrybuj:
Komentarze do posta (Atom)
Brak komentarzy:
Prześlij komentarz