Difference between revisions of "Z80 Routines:Optimized:PopCountA"
From WikiTI
Line 6: | Line 6: | ||
<nowiki>;input: b | <nowiki>;input: b | ||
;output: a | ;output: a | ||
+ | ;destroys: f, c | ||
;27 bytes, 108 clock cycles | ;27 bytes, 108 clock cycles | ||
TypicalPopCountA: | TypicalPopCountA: | ||
Line 29: | Line 30: | ||
Better routine: | Better routine: | ||
<nowiki>;- Pop Count A | <nowiki>;- Pop Count A | ||
− | ;input: byte in | + | ;input: byte in a |
− | ;output: number of set bits in | + | ;output: number of set bits in a |
− | ;destroys | + | ;destroys f, bc |
;22 bytes and 85 clock cycles | ;22 bytes and 85 clock cycles | ||
;author: jacobly | ;author: jacobly |
Revision as of 19:36, 15 September 2020
This is a faster/smaller (and also obfuscated) replacement for the normal popcount a, which returns the number of set bits in a.
Typical routine:
;input: b ;output: a ;destroys: f, c ;27 bytes, 108 clock cycles TypicalPopCountA: xor a ld c, a rrc b adc a, c rrc b adc a, c rrc b adc a, c rrc b adc a, c rrc b adc a, c rrc b adc a, c rrc b adc a, c rrc b adc a, c
Better routine:
;- Pop Count A ;input: byte in a ;output: number of set bits in a ;destroys f, bc ;22 bytes and 85 clock cycles ;author: jacobly PopCountA: ld c, a ; c=(A|B|C|D|E|F|G|H) and 10101010b ; a=(A|0|C|0|E|0|G|0) cpl ; a=(~A|1|~C|1|~E|1|~G|1) rrca ; a=(1|~A|1|~C|1|~E|1|~G), cf=1 adc a, c ; a=(A+B|C+D|E+F|G+H) ld b, a ; b=(A+B|C+D|E+F|G+H) and 00110011b ; a=(00|C+D|00|G+H) ld c, a ; c=(00|C+D|00|G+H) xor b ; a=(A+B|00|E+F|00) rrca \ rrca ; a=(0|A+B|00|E+F|0) add a, c ; a=(A+B+C+D|E+F+G+H) ld c, a ; c=(A+B+C+D|E+F+G+H) rrca \ rrca \ rrca \ rrca ; a=(E+F+G+H|A+B+C+D) add a, c ; a=(A+B+C+D+E+F+G+H|A+B+C+D+E+F+G+H) and 00001111b ; a=A+B+C+D+E+F+G+H