Here is a fast multiplication routine. In its worst case, it runs through the loop 8 times. Multiply any h by any l and the result is returned in de; this might be fast enough to drive ray-tracing :-)
application/x-unknown-content-type-asm_auto_file; name="rpmult.asm"