Moreover, Intel 8080 and its derivatives, like Z80, where frequently much faster for floating-point operantions than 6502 derivatives, depending on their relative clock frequencies and on the speed of the memory.
The main reason was that 8080/Z80 could do much faster FP multiplications (which were extremely slow on 8-bit microprocessors, typically requiring many milliseconds or even tens of milliseconds per FP64 multiplication), because those could be implemented by using 16-bit additions and 16-bit shifts and accumulating partial results in registers (i.e. by using the 16-bit index operations and registers, not the 8-bit accumulator operations). 6502 had only 8-bit operations and too few registers to keep partial results in them, so the partial results had to be stored in memory.
Nevertheless, the great advantage of 8080/Z80 in computational speed was not always realized, because a lot of the 8080/Z80 programs used naive multiplication procedures that were an order of magnitude or more slower than the optimized multiplication.
Even in the Microsoft BASIC and in the run-time library of the Microsoft FORTRAN compiler for CP/M the floating-point operations did not have an optimum implementation, so after reverse-engineering them and replacing the core algorithms I could speed up a lot my programs in MS CP/M FORTRAN and BASIC.
The AMD floating-point peripherals for 8080/Z80, Am9511 and Am9512, which were second sourced by Intel as 8231 and 8232, were faster than achievable in software. They used a microprogrammed implementation, IIRC with a 16-bit ALU.
Am9511/Intel 8231 was not much faster than optimized software, but Am9512/Intel 8232 was much faster. However the latter was launched only in 1980, not much before Intel 8087 and after the first discussions for the standardization of the Intel 8087 FP formats, so Am9512/Intel 8232 was actually the first hardware FPU to implement them (i.e. the future IEEE 754-1985 standard). Since 8087 was available only for 8086/8088, 9512/8232 remained the solution available for 8080/Z80, but I doubt that many have used it, because whoever had money to pay for an expensive FPU would have been likely to also pay for a better 16-bit CPU, instead of staying with Z80.