> I tried to switch from 16 -> 8 bit > mvd1616 goes from 283cycles -> 292cycles and mvload 108->114cycles > mvd88 stays at 354 > > i did not look at the gcc generated asm (did just eat) Did you try timing the overall performance? I suspect cache effects are more important than the speed of local functions. Dark Shikari