Quote:
n this case each byte would take a couple of cycles to calculate the address and one cycle to read/write memory. So it could take almost 40 cycles to move all eight registers. I think this will add too much interrupt latency. So I'm going to use individual push/pop instructions for every register.
How fast is the clock? If it is fast enough having greater latency may not be an issue. For instance, at 10 MHz the instruction can take 10x as many clocks and still have the same latency as at 1 MHz.