318

Software Optimization Guide for AMD64 Processors

25112 Rev. 3.06 September 2005

Table 18. SSE Instructions (Continued)

		Encoding			Decode
Syntax						FPU pipe(s)	Latency	Note
	Prefix	First	2nd
				ModRM byte	type

	byte	byte	byte
	byte	byte	byte

COMISS xmmreg,	0Fh	2Fh		mm-xxx-xxx	VectorPath		6
mem32

CVTPI2PS xmmreg,	0Fh	2Ah		11-xxx-xxx	DirectPath		4
mmreg

CVTPI2PS xmmreg,	0Fh	2Ah		mm-xxx-xxx	DirectPath		6
mem64

CVTPS2PI mmreg,	0Fh	2Dh		11-xxx-xxx	DirectPath		4
xmmreg

CVTPS2PI mmreg,	0Fh	2Dh		mm-xxx-xxx	DirectPath		6
mem128

CVTSI2SS xmmreg,	F3h	0Fh	2Ah	11-xxx-xxx	VectorPath		14
reg32/64

CVTSI2SS xmmreg,	F3h	0Fh	2Ah	mm-xxx-xxx	Double		9
mem32/64

CVTSS2SI reg32,	F3h	0Fh	2Dh	11-xxx-xxx	Double		9
xmmreg

CVTSS2SI reg32,	F3h	0Fh	2Dh	mm-xxx-xxx	VectorPath		10
mem32

CVTTPS2PI mmreg,	0Fh	2Ch		11-xxx-xxx	DirectPath		4
xmmreg

CVTTPS2PI mmreg,	0Fh	2Ch		mm-xxx-xxx	DirectPath		6
mem128

CVTTSS2SI reg32,	F3h	0Fh	2Ch	11-xxx-xxx	Double		9
xmmreg

CVTTSS2SI reg32,	F3h	0Fh	2Ch	mm-xxx-xxx	VectorPath		10
mem32

DIVPS xmmreg1,	0Fh	5Eh		11-xxx-xxx	Double	FMUL	33
xmmreg2

Notes:

1. The low half of the result is available one cycle earlier than listed.

2. The second latency value indicates when the low half of the result becomes available.

3. The high half of the result is available one cycle earlier than listed.

4. The latency listed is the absolute minimum, while average latencies may be higher and are a function of internal pipeline conditions.

5. For the PREFETCHNTA/T0/T1/T2 instructions, the mem8 value refers to an address in the 64-byte line to be prefetched.

6. The 8-clock latency is only visible to younger stores that need to do an external write. The 2-clock latency is visible to the other stores and instructions.

7. This is the execution latency for the instruction. The time to complete the external write depends on the memory speed and the hardware implementation.

Instruction Latencies

Appendix C

AMD 250 manual 318

Models: 250

Software Optimization Guide for AMD64 Processors

Table 18. SSE Instructions (Continued)

FMUL

318