{ VFMSUB213PS VFMSUB213PS vfmsub213psHFused Multiply-Subtract of Packed Single-Precision Floating-Point Values vfmsub213ps9H vfmsub213psH vfmsub213ps:H vfmsub213psH vfmsub213ps;H vfmsub213psH vfmsub213ps9H vfmsub213ps# vfmsub213psH vfmsub213ps/# vfmsub213ps:H vfmsub213ps# vfmsub213psH vfmsub213ps2# vfmsub213ps;H vfmsub213psH vfmsub213psQH vfmsub213psQHEhttps://www.felixcloutier.com/x86/vfmsub132ps:vfmsub213ps:vfmsub231psNEGNEGnegTwo's Complement NegationnegbNEGB negwNEGW neglNEGLnegbNEGB#negwNEGW$neglNEGL'%https://www.felixcloutier.com/x86/negVPERMT2BVPERMT2Bvpermt2b9Full Permute of Bytes From Two Tables Overwriting a Table vpermt2bTvpermt2b/Tvpermt2bTvpermt2b2Tvpermt2bTvpermt2b5Tvpermt2bTvpermt2b/Tvpermt2bTvpermt2b2Tvpermt2bTvpermt2b5T*https://www.felixcloutier.com/x86/vpermt2bPSUBSWPSUBSWpsubsw;Subtract Packed Signed Word Integers with Signed SaturationpsubswPSUBSW psubswPSUBSW+ psubswPSUBSWpsubswPSUBSW//https://www.felixcloutier.com/x86/psubsb:psubswVMOVWVMOVWvmovw Move WordvmovwRvmovwRvmovw$Rvmovw$R'https://www.felixcloutier.com/x86/vmovwVFMSUBADD132PSVFMSUBADD132PSvfmsubadd132psXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubadd132ps9Hvfmsubadd132psHvfmsubadd132ps:Hvfmsubadd132psHvfmsubadd132ps;Hvfmsubadd132psHvfmsubadd132ps9Hvfmsubadd132ps#vfmsubadd132psHvfmsubadd132ps/#vfmsubadd132ps:Hvfmsubadd132ps#vfmsubadd132psHvfmsubadd132ps2#vfmsubadd132ps;Hvfmsubadd132psHvfmsubadd132psQHvfmsubadd132psQHNhttps://www.felixcloutier.com/x86/vfmsubadd132ps:vfmsubadd213ps:vfmsubadd231ps CVTTPD2PI CVTTPD2PI cvttpd2piRConvert with Truncation Packed Double-Precision FP Values to Packed Dword Integers cvttpd2pi CVTTPD2PL cvttpd2pi CVTTPD2PL/+https://www.felixcloutier.com/x86/cvttpd2piMINPSMINPSminps<Return Minimum Packed Single-Precision Floating-Point ValuesminpsMINPSminpsMINPS/'https://www.felixcloutier.com/x86/minpsVGF2P8AFFINEINVQBVGF2P8AFFINEINVQBvgf2p8affineinvqb0Galois Field (2^8) Affine Inverse Transformationvgf2p8affineinvqb=Kvgf2p8affineinvqbKvgf2p8affineinvqb?Kvgf2p8affineinvqbKvgf2p8affineinvqbAHvgf2p8affineinvqbHvgf2p8affineinvqb=Kvgf2p8affineinvqb vgf2p8affineinvqbKvgf2p8affineinvqb/ vgf2p8affineinvqb?Kvgf2p8affineinvqb vgf2p8affineinvqbKvgf2p8affineinvqb2 vgf2p8affineinvqbAHvgf2p8affineinvqbHVPSHAQVPSHAQvpshaq!Packed Shift Arithmetic Quadwordsvpshaq"vpshaq/"vpshaq/" VFNMADD132PH VFNMADD132PH vfnmadd132phJFused Negative Multiply-Add of Packed Half-Precision Floating-Point Values vfnmadd132ph<K vfnmadd132phK vfnmadd132ph>K vfnmadd132phK vfnmadd132ph@R vfnmadd132phR vfnmadd132ph<K vfnmadd132phK vfnmadd132ph>K vfnmadd132phK vfnmadd132ph@R vfnmadd132phR vfnmadd132phQR vfnmadd132phQRlhttps://www.felixcloutier.com/x86/vfmadd132ph:vfnmadd132ph:vfmadd213ph:vfnmadd213ph:vfmadd231ph:vfnmadd231phPAVGWPAVGWpavgwAverage Packed Word Integerspavgw pavgw+ pavgwpavgw/-https://www.felixcloutier.com/x86/pavgb:pavgwVPHADDDVPHADDDvphaddd(Packed Horizontal Add Doubleword Integervphaddd vphaddd/ vphaddd!vphaddd2!VPBLENDDVPBLENDDvpblenddBlend Packed Doublewordsvpblendd!vpblendd/!vpblendd!vpblendd2!*https://www.felixcloutier.com/x86/vpblendd VCVTPD2PH VCVTPD2PH vcvtpd2phLConvert Packed Double-Precision FP Values to Packed Half-Precision FP Values vcvtpd2phx=K vcvtpd2phy?K vcvtpd2phzAR vcvtpd2phxK vcvtpd2phyK vcvtpd2phzR vcvtpd2phx=K vcvtpd2phy?K vcvtpd2phzAR vcvtpd2phxK vcvtpd2phyK vcvtpd2phzR vcvtpd2phzQR vcvtpd2phzQR+https://www.felixcloutier.com/x86/vcvtpd2phPMULLDPMULLDpmulld?Multiply Packed Signed Doubleword Integers and Store Low Resultpmulldpmulld//https://www.felixcloutier.com/x86/pmulld:pmullq VCVTPS2UDQ VCVTPS2UDQ vcvtps2udqbConvert Packed Single-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Values vcvtps2udq9H vcvtps2udq:H vcvtps2udq;H vcvtps2udqH vcvtps2udqH vcvtps2udqH vcvtps2udq9H vcvtps2udqH vcvtps2udq:H vcvtps2udqH vcvtps2udq;H vcvtps2udqH vcvtps2udqQH vcvtps2udqQH,https://www.felixcloutier.com/x86/vcvtps2udq VCVTTPH2UQQ VCVTTPH2UQQ vcvttph2uqqnConvert with Truncation Packed Half Precision Floating-Point Values to Packed Unsigned Quadword Integer Values vcvttph2uqq*K vcvttph2uqq.K vcvttph2uqq<R vcvttph2uqqK vcvttph2uqqK vcvttph2uqqR vcvttph2uqq*K vcvttph2uqqK vcvttph2uqq.K vcvttph2uqqK vcvttph2uqq<R vcvttph2uqqR vcvttph2uqqRR vcvttph2uqqRR-https://www.felixcloutier.com/x86/vcvttph2uqq VCVTTSD2USI VCVTTSD2USI vcvttsd2usiXConvert with Truncation Scalar Double-Precision Floating-Point Value to Unsigned Integer vcvttsd2usiH vcvttsd2usi+H vcvttsd2usiRH-https://www.felixcloutier.com/x86/vcvttsd2usi VPCOMPRESSB VPCOMPRESSB vpcompressbBStore Sparse Packed Byte Integer Values into Dense Memory/Register  vpcompressb0K vpcompressbK vpcompressb3K vpcompressbK vpcompressb6U vpcompressbU vpcompressbK vpcompressbK vpcompressbU vpcompressb/K vpcompressb2K vpcompressb5U8https://www.felixcloutier.com/x86/vpcompressb:vcompresswVPSHLDWVPSHLDWvpshldw3Concatenate and Shift Packed Word Data Left Logical vpshldwKvpshldw/KvpshldwKvpshldw2KvpshldwUvpshldw5UvpshldwKvpshldw/KvpshldwKvpshldw2KvpshldwUvpshldw5UHADDPSHADDPShaddpsPacked Single-FP Horizontal Addhaddpshaddps/(https://www.felixcloutier.com/x86/haddpsPHSUBSWPHSUBSWphsubswFPacked Horizontal Subtract Signed Word Integers with Signed Saturationphsubswphsubsw+phsubswphsubsw/)https://www.felixcloutier.com/x86/phsubswVMOVNTPDVMOVNTPDvmovntpdKStore Packed Double-Precision Floating-Point Values Using Non-Temporal Hintvmovntpd/ vmovntpd/Hvmovntpd2 vmovntpd2Hvmovntpd5HVBROADCASTI32X8VBROADCASTI32X8vbroadcasti32x8#Broadcast Eight Doubleword Elementsvbroadcasti32x82Jvbroadcasti32x82JCLWBCLWBclwbCache Line Write Backclwb#;&https://www.felixcloutier.com/x86/clwbPSRADPSRADpsrad-Shift Packed Doubleword Data Right Arithmeticpsrad psrad psrad+ psradpsradpsrad/3https://www.felixcloutier.com/x86/psraw:psrad:psraqTPAUSETPAUSEtpause Timed PAUSEtpauseG(https://www.felixcloutier.com/x86/tpause VFNMADD231SS VFNMADD231SS vfnmadd231ssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Values vfnmadd231ssH vfnmadd231ss'H vfnmadd231ss# vfnmadd231ssH vfnmadd231ss'# vfnmadd231ss'H vfnmadd231ssQH vfnmadd231ssQHHhttps://www.felixcloutier.com/x86/vfnmadd132ss:vfnmadd213ss:vfnmadd231ssVPERMI2WVPERMI2Wvpermi2w;Full Permute of Words From Two Tables Overwriting the Index vpermi2wIvpermi2w/Ivpermi2wIvpermi2w2Ivpermi2wIvpermi2w5Ivpermi2wIvpermi2w/Ivpermi2wIvpermi2w2Ivpermi2wIvpermi2w5IPhttps://www.felixcloutier.com/x86/vpermi2w:vpermi2d:vpermi2q:vpermi2ps:vpermi2pdBLSMSKBLSMSKblsmskMask From Lowest Set Bitblsmskl4blsmskl'4(https://www.felixcloutier.com/x86/blsmskCMOVPECMOVPEcmovpeMove if parity even (PF == 1)cmovpew  cmovpew $cmovpelcmovpel' VPCMPISTRI VPCMPISTRI vpcmpistri4Packed Compare Implicit Length Strings, Return Index vpcmpistri  vpcmpistri/ PABSDPABSDpabsd,Packed Absolute Value of Doubleword Integerspabsdpabsd+pabsdpabsd/9https://www.felixcloutier.com/x86/pabsb:pabsw:pabsd:pabsqKNOTWKNOTWknotwNOT 16-bit Mask RegisterknotwH9https://www.felixcloutier.com/x86/knotw:knotb:knotq:knotd VFMSUB132PS VFMSUB132PS vfmsub132psHFused Multiply-Subtract of Packed Single-Precision Floating-Point Values vfmsub132ps9H vfmsub132psH vfmsub132ps:H vfmsub132psH vfmsub132ps;H vfmsub132psH vfmsub132ps9H vfmsub132ps# vfmsub132psH vfmsub132ps/# vfmsub132ps:H vfmsub132ps# vfmsub132psH vfmsub132ps2# vfmsub132ps;H vfmsub132psH vfmsub132psQH vfmsub132psQHEhttps://www.felixcloutier.com/x86/vfmsub132ps:vfmsub213ps:vfmsub231psLZCNTLZCNTlzcnt%Count the Number of Leading Zero Bitslzcntw  3lzcntw $3lzcntl3lzcntl'3'https://www.felixcloutier.com/x86/lzcnt VINSERTF128 VINSERTF128 vinsertf128#Insert Packed Floating-Point Values vinsertf128  vinsertf128/ ahttps://www.felixcloutier.com/x86/vinsertf128:vinsertf32x4:vinsertf64x2:vinsertf32x8:vinsertf64x4VMINPHVMINPHvminph:Return Minimum Packed Half-Precision Floating-Point Valuesvminph<KvminphKvminph>KvminphKvminph@RvminphRvminph<KvminphKvminph>KvminphKvminph@RvminphRvminphRRvminphRR(https://www.felixcloutier.com/x86/vminphVPAVGWVPAVGWvpavgwAverage Packed Word IntegersvpavgwIvpavgw/IvpavgwIvpavgw2IvpavgwIvpavgw5Ivpavgw vpavgwIvpavgw/ vpavgw/Ivpavgw!vpavgwIvpavgw2!vpavgw2IvpavgwIvpavgw5IVPMOVQ2MVPMOVQ2Mvpmovq2m7Move Signs of Packed Quadword Integers to Mask Registervpmovq2mJvpmovq2mJvpmovq2mJEhttps://www.felixcloutier.com/x86/vpmovb2m:vpmovw2m:vpmovd2m:vpmovq2mVPSIGNDVPSIGNDvpsignd"Packed Sign of Doubleword Integersvpsignd vpsignd/ vpsignd!vpsignd2!VAESIMCVAESIMCvaesimc+Perform the AES InvMixColumn Transformationvaesimc vaesimc/  VPDPWUSDS VPDPWUSDS vpdpwusdsXPacked Dot Product of Unsigned-by-Signed Word subvectors into Doubleword with Saturation vpdpwusdsY vpdpwusds/Y vpdpwusdsY vpdpwusds2YVPSRAVDVPSRAVDvpsravd6Variable Shift Packed Doubleword Data Right Arithmeticvpsravd9HvpsravdHvpsravd:HvpsravdHvpsravd;HvpsravdHvpsravd9Hvpsravd!vpsravdHvpsravd/!vpsravd:Hvpsravd!vpsravdHvpsravd2!vpsravd;HvpsravdH9https://www.felixcloutier.com/x86/vpsravw:vpsravd:vpsravq VCVTSI2SS VCVTSI2SS vcvtsi2ss9Convert Dword Integer to Scalar Single-Precision FP Value vcvtsi2ssl  vcvtsi2sslH vcvtsi2ssl'  vcvtsi2ssl'H vcvtsi2sslQHVPMOVSWBVPMOVSWBvpmovswbEDown Convert Packed Word Values to Byte Values with Signed Saturation vpmovswbIvpmovswb,IvpmovswbIvpmovswb0IvpmovswbIvpmovswb3IvpmovswbIvpmovswbIvpmovswbIvpmovswb+Ivpmovswb/Ivpmovswb2I<https://www.felixcloutier.com/x86/vpmovwb:vpmovswb:vpmovuswbSTDSTDstdSet Direction FlagstdSTD%https://www.felixcloutier.com/x86/stdMOVSXMOVSXmovsxMove with Sign-ExtensionmovsbwMOVBWSX  movsbwMOVBWSX #movsblMOVBLSX movswlMOVWLSX movsblMOVBLSX#movswlMOVWLSX$.https://www.felixcloutier.com/x86/movsx:movsxdFEMMSFEMMSfemmsFast Exit Multimedia StatefemmsPFADDPFADDpfaddPacked Floating-Point Addpfaddpfadd+VFMADDSUB132PHVFMADDSUB132PHvfmaddsub132phVFused Multiply-Alternating Add/Subtract of Packed Half-Precision Floating-Point Valuesvfmaddsub132ph<Kvfmaddsub132phKvfmaddsub132ph>Kvfmaddsub132phKvfmaddsub132ph@Rvfmaddsub132phRvfmaddsub132ph<Kvfmaddsub132phKvfmaddsub132ph>Kvfmaddsub132phKvfmaddsub132ph@Rvfmaddsub132phRvfmaddsub132phQRvfmaddsub132phQRNhttps://www.felixcloutier.com/x86/vfmaddsub132ph:vfmaddsub213ph:vfmaddsub231phBLCMSKBLCMSKblcmskMask From Lowest Clear Bitblcmsk6blcmsk'6PFRCPPFRCPpfrcp.Packed Floating-Point Reciprocal Approximationpfrcppfrcp+LDMXCSRLDMXCSRldmxcsrLoad MXCSR Registerldmxcsr')https://www.felixcloutier.com/x86/ldmxcsr VFMADD231PS VFMADD231PS vfmadd231psCFused Multiply-Add of Packed Single-Precision Floating-Point Values vfmadd231ps9H vfmadd231psH vfmadd231ps:H vfmadd231psH vfmadd231ps;H vfmadd231psH vfmadd231ps9H vfmadd231ps# vfmadd231psH vfmadd231ps/# vfmadd231ps:H vfmadd231ps# vfmadd231psH vfmadd231ps2# vfmadd231ps;H vfmadd231psH vfmadd231psQH vfmadd231psQHEhttps://www.felixcloutier.com/x86/vfmadd132ps:vfmadd213ps:vfmadd231psMOVQ2DQMOVQ2DQmovq2dq1Move Quadword from MMX Technology to XMM Registermovq2dq)https://www.felixcloutier.com/x86/movq2dq VFNMSUB231PD VFNMSUB231PD vfnmsub231pdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Values vfnmsub231pd=H vfnmsub231pdH vfnmsub231pd?H vfnmsub231pdH vfnmsub231pdAH vfnmsub231pdH vfnmsub231pd=H vfnmsub231pd# vfnmsub231pdH vfnmsub231pd/# vfnmsub231pd?H vfnmsub231pd# vfnmsub231pdH vfnmsub231pd2# vfnmsub231pdAH vfnmsub231pdH vfnmsub231pdQH vfnmsub231pdQHHhttps://www.felixcloutier.com/x86/vfnmsub132pd:vfnmsub213pd:vfnmsub231pdPXORPXORpxor#Packed Bitwise Logical Exclusive ORpxorPXOR pxorPXOR+ pxorPXORpxorPXOR/&https://www.felixcloutier.com/x86/pxor SERIALIZE SERIALIZE serializeSerialize Instruction Execution serializeF+https://www.felixcloutier.com/x86/serialize VCVTSS2USI VCVTSS2USI vcvtss2usiSConvert Scalar Single-Precision Floating-Point Value to Unsigned Doubleword Integer vcvtss2usiH vcvtss2usi'H vcvtss2usiQH,https://www.felixcloutier.com/x86/vcvtss2usi VCVTTPS2DQ VCVTTPS2DQ vcvttps2dqRConvert with Truncation Packed Single-Precision FP Values to Packed Dword Integers vcvttps2dq9H vcvttps2dq:H vcvttps2dq;H vcvttps2dqH vcvttps2dqH vcvttps2dqH vcvttps2dq9H vcvttps2dq  vcvttps2dqH vcvttps2dq/  vcvttps2dq:H vcvttps2dq  vcvttps2dqH vcvttps2dq2  vcvttps2dq;H vcvttps2dqH vcvttps2dqRH vcvttps2dqRHVFMSUBADD213PDVFMSUBADD213PDvfmsubadd213pdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubadd213pd=Hvfmsubadd213pdHvfmsubadd213pd?Hvfmsubadd213pdHvfmsubadd213pdAHvfmsubadd213pdHvfmsubadd213pd=Hvfmsubadd213pd#vfmsubadd213pdHvfmsubadd213pd/#vfmsubadd213pd?Hvfmsubadd213pd#vfmsubadd213pdHvfmsubadd213pd2#vfmsubadd213pdAHvfmsubadd213pdHvfmsubadd213pdQHvfmsubadd213pdQHNhttps://www.felixcloutier.com/x86/vfmsubadd132pd:vfmsubadd213pd:vfmsubadd231pdVFRCZSDVFRCZSDvfrczsd7Extract Fraction Scalar Double-Precision Floating-Pointvfrczsd"vfrczsd+"PSUBWPSUBWpsubwSubtract Packed Word IntegerspsubwPSUBW psubwPSUBW+ psubwPSUBWpsubwPSUBW/3https://www.felixcloutier.com/x86/psubb:psubw:psubdJSJSjsJump if sign (SF == 1)jsJMINjsJMIOTZMSKTZMSKtzmskMask From Trailing Zerostzmsk6tzmsk'6 VFNMSUB231SD VFNMSUB231SD vfnmsub231sdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfnmsub231sdH vfnmsub231sd+H vfnmsub231sd# vfnmsub231sdH vfnmsub231sd+# vfnmsub231sd+H vfnmsub231sdQH vfnmsub231sdQHHhttps://www.felixcloutier.com/x86/vfnmsub132sd:vfnmsub213sd:vfnmsub231sdVPANDDVPANDDvpandd1Bitwise Logical AND of Packed Doubleword Integers vpandd9HvpanddHvpandd:HvpanddHvpandd;HvpanddHvpandd9HvpanddHvpandd:HvpanddHvpandd;HvpanddHVPANDNVPANDNvpandnPacked Bitwise Logical AND NOTvpandn vpandn/ vpandn!vpandn2! VPGATHERDD VPGATHERDD vpgatherdd?Gather Packed Doubleword Values Using Signed Doubleword Indices vpgatherddBH vpgatherddFH vpgatherddJH vpgatherddB! vpgatherddF!7https://www.felixcloutier.com/x86/vpgatherdd:vpgatherdqVAESKEYGENASSISTVAESKEYGENASSISTvaeskeygenassistAES Round Key Generation Assistvaeskeygenassist vaeskeygenassist/  VFNMADDPD VFNMADDPD vfnmaddpdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Values vfnmaddpd$ vfnmaddpd/$ vfnmaddpd/$ vfnmaddpd$ vfnmaddpd2$ vfnmaddpd2$ VPGATHERDQ VPGATHERDQ vpgatherdq=Gather Packed Quadword Values Using Signed Doubleword Indices vpgatherdqBH vpgatherdqBH vpgatherdqFH vpgatherdqB! vpgatherdqB!7https://www.felixcloutier.com/x86/vpgatherdq:vpgatherqqKSHIFTRWKSHIFTRWkshiftrwShift Right 16-bit MaskskshiftrwHEhttps://www.felixcloutier.com/x86/kshiftrw:kshiftrb:kshiftrq:kshiftrd VCVTUSI2SH VCVTUSI2SH vcvtusi2shFConvert Unsigned Integer to Scalar Half-Precision Floating-Point Value vcvtusi2shlR vcvtusi2shl'R vcvtusi2shlQR,https://www.felixcloutier.com/x86/vcvtusi2shVPHSUBWVPHSUBWvphsubw(Packed Horizontal Subtract Word Integersvphsubw vphsubw/ vphsubw!vphsubw2!VPPERMVPPERMvppermPacked Permute Bytesvpperm"vpperm/"vpperm/" VCVTSD2SI VCVTSD2SI vcvtsd2si3Convert Scalar Double-Precision FP Value to Integer vcvtsd2si  vcvtsd2siH vcvtsd2si+  vcvtsd2si+H vcvtsd2siQHVPMAXSQVPMAXSQvpmaxsq*Maximum of Packed Signed Quadword Integers vpmaxsq=HvpmaxsqHvpmaxsq?HvpmaxsqHvpmaxsqAHvpmaxsqHvpmaxsq=HvpmaxsqHvpmaxsq?HvpmaxsqHvpmaxsqAHvpmaxsqHVPSHRDQVPSHRDQvpshrdq8Concatenate and Shift Packed Quadword Data Right Logical vpshrdq=KvpshrdqKvpshrdq?KvpshrdqKvpshrdqAUvpshrdqUvpshrdq=KvpshrdqKvpshrdq?KvpshrdqKvpshrdqAUvpshrdqU VCVTSD2SH VCVTSD2SH vcvtsd2shJConvert Scalar Double-Precision FP Value to Scalar Half-Precision FP Value vcvtsd2shR vcvtsd2sh+R vcvtsd2shR vcvtsd2sh+R vcvtsd2shQR vcvtsd2shQR+https://www.felixcloutier.com/x86/vcvtsd2shCVTDQ2PSCVTDQ2PScvtdq2psBConvert Packed Dword Integers to Packed Single-Precision FP Valuescvtdq2pscvtdq2ps/*https://www.felixcloutier.com/x86/cvtdq2psSUBPDSUBPDsubpd6Subtract Packed Double-Precision Floating-Point ValuessubpdSUBPDsubpdSUBPD/'https://www.felixcloutier.com/x86/subpd VSHA512MSG2 VSHA512MSG2 vsha512msg2FPerform a Final Calculation for the Next Four SHA512 Message Quadwords vsha512msg2) VSHA512RNDS2 VSHA512RNDS2 vsha512rnds2&Perform Two Rounds of SHA512 Operation vsha512rnds2) CVTTSS2SI CVTTSS2SI cvttss2siIConvert with Truncation Scalar Single-Precision FP Value to Dword Integer cvttss2si CVTTSS2SL cvttss2si CVTTSS2SL'+https://www.felixcloutier.com/x86/cvttss2siVMOVHPSVMOVHPSvmovhps7Move High Packed Single-Precision Floating-Point Valuesvmovhps+ vmovhps+Hvmovhps+ vmovhps+HPEXTRBPEXTRBpextrb Extract Bytepextrbpextrb#6https://www.felixcloutier.com/x86/pextrb:pextrd:pextrqANDNPSANDNPSandnpsHBitwise Logical AND NOT of Packed Single-Precision Floating-Point ValuesandnpsANDNPSandnpsANDNPS/(https://www.felixcloutier.com/x86/andnpsMOVDQUMOVDQUmovdquMove Unaligned Double QuadwordmovdquMOVOUmovdquMOVOU/movdquMOVOU/Ohttps://www.felixcloutier.com/x86/movdqu:vmovdqu8:vmovdqu16:vmovdqu32:vmovdqu64SUBPSSUBPSsubps6Subtract Packed Single-Precision Floating-Point ValuessubpsSUBPSsubpsSUBPS/'https://www.felixcloutier.com/x86/subps VFNMADD213SS VFNMADD213SS vfnmadd213ssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Values vfnmadd213ssH vfnmadd213ss'H vfnmadd213ss# vfnmadd213ssH vfnmadd213ss'# vfnmadd213ss'H vfnmadd213ssQH vfnmadd213ssQHHhttps://www.felixcloutier.com/x86/vfnmadd132ss:vfnmadd213ss:vfnmadd231ssVSUBSDVSUBSDvsubsd6Subtract Scalar Double-Precision Floating-Point ValuesvsubsdHvsubsd+Hvsubsd vsubsdHvsubsd+ vsubsd+HvsubsdQHvsubsdQHVPSRAWVPSRAWvpsraw'Shift Packed Word Data Right ArithmeticvpsrawIvpsrawIvpsraw/IvpsrawIvpsrawIvpsraw/IvpsrawIvpsrawIvpsraw/Ivpsraw/Ivpsraw2Ivpsraw5Ivpsraw vpsrawIvpsraw vpsrawIvpsraw/ vpsraw/Ivpsraw/Ivpsraw!vpsrawIvpsraw!vpsrawIvpsraw/!vpsraw/Ivpsraw2IvpsrawIvpsrawIvpsraw/Ivpsraw5IVSCATTERPF0QPDVSCATTERPF0QPDvscatterpf0qpd‚Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint with Intent to Writevscatterpf0qpdML]https://www.felixcloutier.com/x86/vscatterpf0dps:vscatterpf0qps:vscatterpf0dpd:vscatterpf0qpd VSCALEFSH VSCALEFSH vscalefsh[Scale Scalar Half-Precision Floating-Point Value With a Half-Precision Floating-Point Value vscalefshR vscalefsh$R vscalefshR vscalefsh$R vscalefshQR vscalefshQR+https://www.felixcloutier.com/x86/vscalefsh VCVTTPH2QQ VCVTTPH2QQ vcvttph2qqlConvert with Truncation Packed Half Precision Floating-Point Values to Packed Singed Quadword Integer Values vcvttph2qq*K vcvttph2qq.K vcvttph2qq<R vcvttph2qqK vcvttph2qqK vcvttph2qqR vcvttph2qq*K vcvttph2qqK vcvttph2qq.K vcvttph2qqK vcvttph2qq<R vcvttph2qqR vcvttph2qqRR vcvttph2qqRR,https://www.felixcloutier.com/x86/vcvttph2qqSETCSETCsetcSet byte if carry (CF == 1)setcSETCS setcSETCS#VMOVUPSVMOVUPSvmovups<Move Unaligned Packed Single-Precision Floating-Point Valuesvmovups0HvmovupsHvmovups3HvmovupsHvmovups6HvmovupsHvmovups/Hvmovups2Hvmovups5Hvmovups vmovupsHvmovups/ vmovups/Hvmovups vmovupsHvmovups2 vmovups2HvmovupsHvmovups5Hvmovups/ vmovups/Hvmovups2 vmovups2Hvmovups5HVPCMPEQBVPCMPEQBvpcmpeqb%Compare Packed Byte Data for EqualityvpcmpeqbIvpcmpeqbIvpcmpeqb/Ivpcmpeqb/IvpcmpeqbIvpcmpeqbIvpcmpeqb2Ivpcmpeqb2IvpcmpeqbIvpcmpeqbIvpcmpeqb5Ivpcmpeqb5Ivpcmpeqb vpcmpeqb/ vpcmpeqb!vpcmpeqb2!VPCMPQVPCMPQvpcmpq%Compare Packed Signed Quadword Values vpcmpq=Hvpcmpq=HvpcmpqHvpcmpqHvpcmpq?Hvpcmpq?HvpcmpqHvpcmpqHvpcmpqAHvpcmpqAHvpcmpqHvpcmpqH0https://www.felixcloutier.com/x86/vpcmpq:vpcmpuqCMOVLCMOVLcmovlMove if less (SF != OF)cmovlw  cmovlw $cmovllcmovll'BLCSBLCSblcsSet Lowest Clear Bitblcs6blcs'6VPHSUBDVPHSUBDvphsubd.Packed Horizontal Subtract Doubleword Integersvphsubd vphsubd/ vphsubd!vphsubd2! VPMACSSDQL VPMACSSDQL vpmacssdqlSPacked Multiply Accumulate with Saturation Signed Low Doubleword to Signed Quadword vpmacssdql" vpmacssdql/"VPADDQVPADDQvpaddqAdd Packed Quadword Integersvpaddq=HvpaddqHvpaddq?HvpaddqHvpaddqAHvpaddqHvpaddq=Hvpaddq vpaddqHvpaddq/ vpaddq?Hvpaddq!vpaddqHvpaddq2!vpaddqAHvpaddqHVSM3MSG1VSM3MSG1vsm3msg1?Perform Initial Calculation for the Next Four SM3 Message Wordsvsm3msg1vsm3msg1/PFRSQRTPFRSQRTpfrsqrt:Packed Floating-Point Reciprocal Square Root Approximationpfrsqrtpfrsqrt+VMINSSVMINSSvminss;Return Minimum Scalar Single-Precision Floating-Point ValuevminssHvminss'Hvminss vminssHvminss' vminss'HvminssRHvminssRH VINSERTF64X2 VINSERTF64X2 vinsertf64x2@Insert 128 Bits of Packed Double-Precision Floating-Point Values vinsertf64x2J vinsertf64x2/J vinsertf64x2J vinsertf64x2/J vinsertf64x2J vinsertf64x2/J vinsertf64x2J vinsertf64x2/JCLFLUSHCLFLUSHclflushFlush Cache Lineclflush#9)https://www.felixcloutier.com/x86/clflush VCOMPRESSPS VCOMPRESSPS vcompresspsUStore Sparse Packed Single-Precision Floating-Point Values into Dense Memory/Register  vcompresspsH vcompressps0H vcompresspsH vcompressps3H vcompresspsH vcompressps6H vcompresspsH vcompresspsH vcompresspsH vcompressps/H vcompressps2H vcompressps5H-https://www.felixcloutier.com/x86/vcompressps VPDPWUUDS VPDPWUUDS vpdpwuudsZPacked Dot Product of Unsigned-by-Unsigned Word subvectors into Doubleword with Saturation vpdpwuudsY vpdpwuuds/Y vpdpwuudsY vpdpwuuds2YVPORDVPORDvpord0Bitwise Logical OR of Packed Doubleword Integers vpord9HvpordHvpord:HvpordHvpord;HvpordHvpord9HvpordHvpord:HvpordHvpord;HvpordHPSUBQPSUBQpsubq!Subtract Packed Quadword IntegerspsubqPSUBQpsubqPSUBQ+psubqPSUBQpsubqPSUBQ/'https://www.felixcloutier.com/x86/psubq VUNPCKHPS VUNPCKHPS vunpckhpsHUnpack and Interleave High Packed Single-Precision Floating-Point Values vunpckhps9H vunpckhpsH vunpckhps:H vunpckhpsH vunpckhps;H vunpckhpsH vunpckhps9H vunpckhps  vunpckhpsH vunpckhps/  vunpckhps:H vunpckhps  vunpckhpsH vunpckhps2  vunpckhps;H vunpckhpsHKANDNWKANDNWkandnw$Bitwise Logical AND NOT 16-bit MaskskandnwH=https://www.felixcloutier.com/x86/kandnw:kandnb:kandnq:kandnd VINSERTF64X4 VINSERTF64X4 vinsertf64x4@Insert 256 Bits of Packed Double-Precision Floating-Point Values vinsertf64x4H vinsertf64x42H vinsertf64x4H vinsertf64x42HXADDXADDxaddExchange and AddxaddbXADDB  xaddwXADDW  xaddlXADDLxaddbXADDB# xaddwXADDW$ xaddlXADDL'&https://www.felixcloutier.com/x86/xaddVMULPSVMULPSvmulps6Multiply Packed Single-Precision Floating-Point Valuesvmulps9HvmulpsHvmulps:HvmulpsHvmulps;HvmulpsHvmulps9Hvmulps vmulpsHvmulps/ vmulps:Hvmulps vmulpsHvmulps2 vmulps;HvmulpsHvmulpsQHvmulpsQHSETNBESETNBEsetnbe4Set byte if not below or equal (CF == 0 and ZF == 0)setnbeSETHI setnbeSETHI# VPMACSSDQH VPMACSSDQH vpmacssdqhTPacked Multiply Accumulate with Saturation Signed High Doubleword to Signed Quadword vpmacssdqh" vpmacssdqh/"VPINSRBVPINSRBvpinsrb Insert Bytevpinsrb vpinsrbIvpinsrb# vpinsrb#IVPLZCNTQVPLZCNTQvplzcntq@Count the Number of Leading Zero Bits for Packed Quadword Values vplzcntq=Nvplzcntq?NvplzcntqANvplzcntqNvplzcntqNvplzcntqNvplzcntq=NvplzcntqNvplzcntq?NvplzcntqNvplzcntqANvplzcntqN3https://www.felixcloutier.com/x86/vplzcntd:vplzcntq VFMSUB132PH VFMSUB132PH vfmsub132phFFused Multiply-Subtract of Packed Half-Precision Floating-Point Values vfmsub132ph<K vfmsub132phK vfmsub132ph>K vfmsub132phK vfmsub132ph@R vfmsub132phR vfmsub132ph<K vfmsub132phK vfmsub132ph>K vfmsub132phK vfmsub132ph@R vfmsub132phR vfmsub132phQR vfmsub132phQRlhttps://www.felixcloutier.com/x86/vfmsub132ph:vfnmsub132ph:vfmsub213ph:vfnmsub213ph:vfmsub231ph:vfnmsub231ph VGATHERPF0DPD VGATHERPF0DPD vgatherpf0dpdoSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint vgatherpf0dpdGLYhttps://www.felixcloutier.com/x86/vgatherpf0dps:vgatherpf0qps:vgatherpf0dpd:vgatherpf0qpd VFMSUB231PH VFMSUB231PH vfmsub231phFFused Multiply-Subtract of Packed Half-Precision Floating-Point Values vfmsub231ph<K vfmsub231phK vfmsub231ph>K vfmsub231phK vfmsub231ph@R vfmsub231phR vfmsub231ph<K vfmsub231phK vfmsub231ph>K vfmsub231phK vfmsub231ph@R vfmsub231phR vfmsub231phQR vfmsub231phQRlhttps://www.felixcloutier.com/x86/vfmsub132ph:vfnmsub132ph:vfmsub213ph:vfnmsub213ph:vfmsub231ph:vfnmsub231phPMOVSXDQPMOVSXDQpmovsxdqHMove Packed Doubleword Integers to Quadword Integers with Sign Extensionpmovsxdqpmovsxdq+ PREFETCHT0 PREFETCHT0 prefetcht0'Prefetch Data Into Caches using T0 Hint prefetcht0 PREFETCHT0# PABSWPABSWpabsw&Packed Absolute Value of Word Integerspabswpabsw+pabswpabsw/9https://www.felixcloutier.com/x86/pabsb:pabsw:pabsd:pabsqPFRCPIT1PFRCPIT1pfrcpit1,Packed Floating-Point Reciprocal Iteration 1pfrcpit1pfrcpit1+VPSHRDVQVPSHRDVQvpshrdvqAConcatenate and Variable Shift Packed Quadword Data Right Logical vpshrdvq=KvpshrdvqKvpshrdvq?KvpshrdvqKvpshrdvqAUvpshrdvqUvpshrdvq=KvpshrdvqKvpshrdvq?KvpshrdvqKvpshrdvqAUvpshrdvqUTESTTESTtestLogical ComparetestbTESTBtestbTESTB testbTESTB  testwTESTW testwTESTW testwTESTW  testlTESTLtestlTESTLtestlTESTLtestbTESTB#testbTESTB# testwTESTW$testwTESTW$ testlTESTL'testlTESTL'&https://www.felixcloutier.com/x86/test VPSHUFBITQMB VPSHUFBITQMB vpshufbitqmb@Shuffle Bits From Quadword Elements Using Byte Indexes Into Mask  vpshufbitqmbK vpshufbitqmbK vpshufbitqmb/K vpshufbitqmb/K vpshufbitqmbK vpshufbitqmbK vpshufbitqmb2K vpshufbitqmb2K vpshufbitqmbS vpshufbitqmbS vpshufbitqmb5S vpshufbitqmb5S.https://www.felixcloutier.com/x86/vpshufbitqmbVPCMPUBVPCMPUBvpcmpub#Compare Packed Unsigned Byte Values vpcmpubIvpcmpubIvpcmpub/Ivpcmpub/IvpcmpubIvpcmpubIvpcmpub2Ivpcmpub2IvpcmpubIvpcmpubIvpcmpub5Ivpcmpub5I0https://www.felixcloutier.com/x86/vpcmpb:vpcmpubVEXP2PSVEXP2PSvexp2psyApproximation to the Exponential 2^x of Packed Single-Precision Floating-Point Values with Less Than 2^-23 Relative Errorvexp2ps;Mvexp2psMvexp2ps;Mvexp2psMvexp2psRMvexp2psRM)https://www.felixcloutier.com/x86/vexp2ps VPERMT2PD VPERMT2PD vpermt2pdZFull Permute of Double-Precision Floating-Point Values From Two Tables Overwriting a Table  vpermt2pd=H vpermt2pdH vpermt2pd?H vpermt2pdH vpermt2pdAH vpermt2pdH vpermt2pd=H vpermt2pdH vpermt2pd?H vpermt2pdH vpermt2pdAH vpermt2pdHPhttps://www.felixcloutier.com/x86/vpermt2w:vpermt2d:vpermt2q:vpermt2ps:vpermt2pdPADDDPADDDpadddAdd Packed Doubleword Integerspaddd paddd+ padddpaddd/9https://www.felixcloutier.com/x86/paddb:paddw:paddd:paddqIMULIMULimulSigned MultiplyimulbIMULB imulwIMULW imullIMULLimulbIMULB#imulwIMULW$imullIMULL'imulwIMULW  imulwIMULW $imullIMULLimullIMULL'imulw  imulw  imulw $imulw $imullimullimull'imull'&https://www.felixcloutier.com/x86/imul VEXTRACTI32X8 VEXTRACTI32X8 vextracti32x84Extract 256 Bits of Packed Doubleword Integer Values vextracti32x8J vextracti32x83J vextracti32x8J vextracti32x82JSETNGESETNGEsetnge+Set byte if not greater or equal (SF != OF)setngeSETLT setngeSETLT# VMASKMOVDQU VMASKMOVDQU vmaskmovdqu'Store Selected Bytes of Double Quadword vmaskmovdqu VMOVSHVMOVSHvmovsh0Move Scalar Half-Precision Floating-Point Valuesvmovsh%Rvmovsh$Rvmovsh$Rvmovsh$RvmovshRvmovshR(https://www.felixcloutier.com/x86/vmovsh VGATHERPF1QPD VGATHERPF1QPD vgatherpf1qpdmSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint vgatherpf1qpdMLYhttps://www.felixcloutier.com/x86/vgatherpf1dps:vgatherpf1qps:vgatherpf1dpd:vgatherpf1qpdVPADDUSWVPADDUSWvpaddusw:Add Packed Unsigned Word Integers with Unsigned SaturationvpadduswIvpaddusw/IvpadduswIvpaddusw2IvpadduswIvpaddusw5Ivpaddusw vpadduswIvpaddusw/ vpaddusw/Ivpaddusw!vpadduswIvpaddusw2!vpaddusw2IvpadduswIvpaddusw5IVPMAXUQVPMAXUQvpmaxuq,Maximum of Packed Unsigned Quadword Integers vpmaxuq=HvpmaxuqHvpmaxuq?HvpmaxuqHvpmaxuqAHvpmaxuqHvpmaxuq=HvpmaxuqHvpmaxuq?HvpmaxuqHvpmaxuqAHvpmaxuqH VPMULHRSW VPMULHRSW vpmulhrswOPacked Multiply Signed Word Integers and Store High Result with Round and Scale vpmulhrswI vpmulhrsw/I vpmulhrswI vpmulhrsw2I vpmulhrswI vpmulhrsw5I vpmulhrsw  vpmulhrswI vpmulhrsw/  vpmulhrsw/I vpmulhrsw! vpmulhrswI vpmulhrsw2! vpmulhrsw2I vpmulhrswI vpmulhrsw5IJPJPjpJump if parity (PF == 1)jpJPSNjpJPSOPADDBPADDBpaddbAdd Packed Byte Integerspaddb paddb+ paddbpaddb/9https://www.felixcloutier.com/x86/paddb:paddw:paddd:paddqAORAORaor Atomically ORaor'ENDBR32ENDBR32endbr32%END (terminate) BRanch in 32-bit modeendbr32 )https://www.felixcloutier.com/x86/endbr32VPADDDVPADDDvpadddAdd Packed Doubleword Integersvpaddd9HvpadddHvpaddd:HvpadddHvpaddd;HvpadddHvpaddd9Hvpaddd vpadddHvpaddd/ vpaddd:Hvpaddd!vpadddHvpaddd2!vpaddd;HvpadddHMULPDMULPDmulpd6Multiply Packed Double-Precision Floating-Point ValuesmulpdMULPDmulpdMULPD/'https://www.felixcloutier.com/x86/mulpdVFMADDSUB213PHVFMADDSUB213PHvfmaddsub213phVFused Multiply-Alternating Add/Subtract of Packed Half-Precision Floating-Point Valuesvfmaddsub213ph<Kvfmaddsub213phKvfmaddsub213ph>Kvfmaddsub213phKvfmaddsub213ph@Rvfmaddsub213phRvfmaddsub213ph<Kvfmaddsub213phKvfmaddsub213ph>Kvfmaddsub213phKvfmaddsub213ph@Rvfmaddsub213phRvfmaddsub213phQRvfmaddsub213phQRNhttps://www.felixcloutier.com/x86/vfmaddsub132ph:vfmaddsub213ph:vfmaddsub231ph VPERM2F128 VPERM2F128 vperm2f128Permute Floating-Point Values vperm2f128  vperm2f1282 ,https://www.felixcloutier.com/x86/vperm2f128VPROTBVPROTBvprotbPacked Rotate Bytesvprotb"vprotb"vprotb/"vprotb/"vprotb/"VPSRADVPSRADvpsrad-Shift Packed Doubleword Data Right Arithmeticvpsrad9Hvpsrad:Hvpsrad;HvpsradHvpsradHvpsrad/HvpsradHvpsradHvpsrad/HvpsradHvpsradHvpsrad/Hvpsrad9Hvpsrad vpsradHvpsrad vpsradHvpsrad/ vpsrad/Hvpsrad:Hvpsrad!vpsradHvpsrad!vpsradHvpsrad/!vpsrad/Hvpsrad;HvpsradHvpsradHvpsrad/HADDSUBPDADDSUBPDaddsubpdPacked Double-FP Add/Subtractaddsubpdaddsubpd/*https://www.felixcloutier.com/x86/addsubpd VCVTTPD2DQ VCVTTPD2DQ vcvttpd2dqRConvert with Truncation Packed Double-Precision FP Values to Packed Dword Integers vcvttpd2dqx=H vcvttpd2dqy?H vcvttpd2dqAH vcvttpd2dqxH vcvttpd2dqyH vcvttpd2dqH vcvttpd2dqx=H vcvttpd2dqy?H vcvttpd2dqx  vcvttpd2dqxH vcvttpd2dqy  vcvttpd2dqyH vcvttpd2dqx/  vcvttpd2dqy2  vcvttpd2dqAH vcvttpd2dqH vcvttpd2dqRH vcvttpd2dqRH VMOVNTDQA VMOVNTDQA vmovntdqa.Load Double Quadword Non-Temporal Aligned Hint vmovntdqa/  vmovntdqa/H vmovntdqa2! vmovntdqa2H vmovntdqa5HCALLCALLcallCall ProcedurecallCALLOcalllcalll'&https://www.felixcloutier.com/x86/callCDQCDQcdqConvert Doubleword to Quadwordcltd-https://www.felixcloutier.com/x86/cwd:cdq:cqoMONITORMONITORmonitorMonitor a Linear Address RangemonitorD)https://www.felixcloutier.com/x86/monitor VFNMADD213PH VFNMADD213PH vfnmadd213phJFused Negative Multiply-Add of Packed Half-Precision Floating-Point Values vfnmadd213ph<K vfnmadd213phK vfnmadd213ph>K vfnmadd213phK vfnmadd213ph@R vfnmadd213phR vfnmadd213ph<K vfnmadd213phK vfnmadd213ph>K vfnmadd213phK vfnmadd213ph@R vfnmadd213phR vfnmadd213phQR vfnmadd213phQRlhttps://www.felixcloutier.com/x86/vfmadd132ph:vfnmadd132ph:vfmadd213ph:vfnmadd213ph:vfmadd231ph:vfnmadd231phVPSHAWVPSHAWvpshawPacked Shift Arithmetic Wordsvpshaw"vpshaw/"vpshaw/"PMULHRWPMULHRWpmulhrw!Packed Multiply High Rounded Wordpmulhrwpmulhrw+PADDSWPADDSWpaddsw6Add Packed Signed Word Integers with Signed Saturationpaddsw paddsw+ paddswpaddsw//https://www.felixcloutier.com/x86/paddsb:paddsw VFMSUB132SH VFMSUB132SH vfmsub132shFFused Multiply-Subtract of Scalar Half-Precision Floating-Point Values vfmsub132shR vfmsub132sh$R vfmsub132shR vfmsub132sh$R vfmsub132shQR vfmsub132shQRlhttps://www.felixcloutier.com/x86/vfmsub132sh:vfnmsub132sh:vfmsub213sh:vfnmsub213sh:vfmsub231sh:vfnmsub231shVSCATTERPF1DPSVSCATTERPF1DPSvscatterpf1dps„Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint with Intent to Writevscatterpf1dpsKL]https://www.felixcloutier.com/x86/vscatterpf1dps:vscatterpf1qps:vscatterpf1dpd:vscatterpf1qpd VGETMANTPD VGETMANTPD vgetmantpdOExtract Normalized Mantissas from Packed Double-Precision Floating-Point Values vgetmantpd=H vgetmantpd?H vgetmantpdAH vgetmantpdH vgetmantpdH vgetmantpdH vgetmantpd=H vgetmantpdH vgetmantpd?H vgetmantpdH vgetmantpdAH vgetmantpdH vgetmantpdRH vgetmantpdRH,https://www.felixcloutier.com/x86/vgetmantpdVPEXTRDVPEXTRDvpextrdExtract Doublewordvpextrd vpextrdJvpextrd' vpextrd'J VPHADDUWQ VPHADDUWQ vphadduwq/Packed Horizontal Add Unsigned Word to Quadword vphadduwq" vphadduwq/"PSRAWPSRAWpsraw'Shift Packed Word Data Right Arithmeticpsraw psraw psraw+ psrawpsrawpsraw/3https://www.felixcloutier.com/x86/psraw:psrad:psraqVDIVPHVDIVPHvdivph2Divide Packed Half-Precision Floating-Point Valuesvdivph<KvdivphKvdivph>KvdivphKvdivph@RvdivphRvdivph<KvdivphKvdivph>KvdivphKvdivph@RvdivphRvdivphQRvdivphQR(https://www.felixcloutier.com/x86/vdivph VFNMADD231PS VFNMADD231PS vfnmadd231psLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Values vfnmadd231ps9H vfnmadd231psH vfnmadd231ps:H vfnmadd231psH vfnmadd231ps;H vfnmadd231psH vfnmadd231ps9H vfnmadd231ps# vfnmadd231psH vfnmadd231ps/# vfnmadd231ps:H vfnmadd231ps# vfnmadd231psH vfnmadd231ps2# vfnmadd231ps;H vfnmadd231psH vfnmadd231psQH vfnmadd231psQHHhttps://www.felixcloutier.com/x86/vfnmadd132ps:vfnmadd213ps:vfnmadd231psVPCOMQVPCOMQvpcomq'Compare Packed Signed Quadword Integersvpcomq"vpcomq/"VPMOVD2MVPMOVD2Mvpmovd2m9Move Signs of Packed Doubleword Integers to Mask Registervpmovd2mJvpmovd2mJvpmovd2mJEhttps://www.felixcloutier.com/x86/vpmovb2m:vpmovw2m:vpmovd2m:vpmovq2mVSQRTSHVSQRTSHvsqrtshACompute Square Root of Scalar Half-Precision Floating-Point ValuevsqrtshRvsqrtsh$RvsqrtshRvsqrtsh$RvsqrtshQRvsqrtshQR)https://www.felixcloutier.com/x86/vsqrtsh VSCATTERQPD VSCATTERQPD vscatterqpdRScatter Packed Double-Precision Floating-Point Values with Signed Quadword Indices vscatterqpdEH vscatterqpdIH vscatterqpdMHQhttps://www.felixcloutier.com/x86/vscatterdps:vscatterdpd:vscatterqps:vscatterqpdXLATBXLATBxlatbTable Look-up TranslationxlatXLAT,https://www.felixcloutier.com/x86/xlat:xlatbAANDAANDaandAtomically ANDaand'VSUBPHVSUBPHvsubph4Subtract Packed Half-Precision Floating-Point Valuesvsubph<KvsubphKvsubph>KvsubphKvsubph@RvsubphRvsubph<KvsubphKvsubph>KvsubphKvsubph@RvsubphRvsubphQRvsubphQR(https://www.felixcloutier.com/x86/vsubphPACKSSDWPACKSSDWpackssdw2Pack Doublewords into Words with Signed Saturationpackssdw packssdw+ packssdwpackssdw/3https://www.felixcloutier.com/x86/packsswb:packssdw VFNMSUB132PH VFNMSUB132PH vfnmsub132phOFused Negative Multiply-Subtract of Packed Half-Precision Floating-Point Values vfnmsub132ph<K vfnmsub132phK vfnmsub132ph>K vfnmsub132phK vfnmsub132ph@R vfnmsub132phR vfnmsub132ph<K vfnmsub132phK vfnmsub132ph>K vfnmsub132phK vfnmsub132ph@R vfnmsub132phR vfnmsub132phQR vfnmsub132phQRlhttps://www.felixcloutier.com/x86/vfmsub132ph:vfnmsub132ph:vfmsub213ph:vfnmsub213ph:vfmsub231ph:vfnmsub231phXORPDXORPDxorpd>Bitwise Logical XOR for Double-Precision Floating-Point ValuesxorpdXORPDxorpdXORPD/'https://www.felixcloutier.com/x86/xorpdMCOMMITMCOMMITmcommit Memory COMMITmcommit> VEXTRACTI64X2 VEXTRACTI64X2 vextracti64x22Extract 128 Bits of Packed Quadword Integer Values vextracti64x2J vextracti64x20J vextracti64x2J vextracti64x20J vextracti64x2J vextracti64x2J vextracti64x2/J vextracti64x2/JVFMSUBSDVFMSUBSDvfmsubsdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfmsubsd$vfmsubsd+$vfmsubsd+$KSHIFTRQKSHIFTRQkshiftrqShift Right 64-bit MaskskshiftrqIEhttps://www.felixcloutier.com/x86/kshiftrw:kshiftrb:kshiftrq:kshiftrd PREFETCHNTA PREFETCHNTA prefetchnta(Prefetch Data Into Caches using NTA Hint prefetchnta PREFETCHNTA# SETNPSETNPsetnp Set byte if not parity (PF == 0)setnpSETPC setnpSETPC#VPSHLDVQVPSHLDVQvpshldvq@Concatenate and Variable Shift Packed Quadword Data Left Logical vpshldvq=KvpshldvqKvpshldvq?KvpshldvqKvpshldvqAUvpshldvqUvpshldvq=KvpshldvqKvpshldvq?KvpshldvqKvpshldvqAUvpshldvqUVPABSBVPABSBvpabsb&Packed Absolute Value of Byte IntegersvpabsbIvpabsbIvpabsbIvpabsb/Ivpabsb2Ivpabsb5Ivpabsb vpabsbIvpabsb/ vpabsb/Ivpabsb!vpabsbIvpabsb2!vpabsb2IvpabsbIvpabsb5I VGATHERDPD VGATHERDPD vgatherdpdTGather Packed Double-Precision Floating-Point Values Using Signed Doubleword Indices vgatherdpdBH vgatherdpdBH vgatherdpdFH vgatherdpdB! vgatherdpdB!7https://www.felixcloutier.com/x86/vgatherdps:vgatherdpdVPERMBVPERMBvpermbPermute Byte Integers vpermbTvpermb/TvpermbTvpermb2TvpermbTvpermb5TvpermbTvpermb/TvpermbTvpermb2TvpermbTvpermb5T(https://www.felixcloutier.com/x86/vpermbVBROADCASTF32X8VBROADCASTF32X8vbroadcastf32x88Broadcast Eight Single-Precision Floating-Point Elementsvbroadcastf32x82Jvbroadcastf32x82JRCPSSRCPSSrcpssOCompute Approximate Reciprocal of Scalar Single-Precision Floating-Point ValuesrcpssRCPSSrcpssRCPSS''https://www.felixcloutier.com/x86/rcpss VAESENCLAST VAESENCLAST vaesenclast,Perform Last Round of an AES Encryption Flow  vaesenclast  vaesenclastK vaesenclast/  vaesenclast/K vaesenclast vaesenclastK vaesenclast2 vaesenclast2K vaesenclastH vaesenclast5HVLDDQUVLDDQUvlddquLoad Unaligned Integer 128 Bitsvlddqu/ vlddqu2 VPALIGNRVPALIGNRvpalignrPacked Align RightvpalignrIvpalignr/IvpalignrIvpalignr2IvpalignrIvpalignr5Ivpalignr vpalignrIvpalignr/ vpalignr/Ivpalignr!vpalignrIvpalignr2!vpalignr2IvpalignrIvpalignr5I VPMOVUSQB VPMOVUSQB vpmovusqbKDown Convert Packed Quadword Values to Byte Values with Unsigned Saturation  vpmovusqbH vpmovusqb%H vpmovusqbH vpmovusqb(H vpmovusqbH vpmovusqb,H vpmovusqbH vpmovusqbH vpmovusqbH vpmovusqb$H vpmovusqb'H vpmovusqb+H<https://www.felixcloutier.com/x86/vpmovqb:vpmovsqb:vpmovusqbHSUBPDHSUBPDhsubpd$Packed Double-FP Horizontal Subtracthsubpdhsubpd/(https://www.felixcloutier.com/x86/hsubpdVMOVHLPSVMOVHLPSvmovhlps>Move Packed Single-Precision Floating-Point Values High to Lowvmovhlps vmovhlpsH VPTESTNMQ VPTESTNMQ vptestnmq;Logical NAND of Packed Quadword Integer Values and Set Mask  vptestnmq=H vptestnmq=H vptestnmqH vptestnmqH vptestnmq?H vptestnmq?H vptestnmqH vptestnmqH vptestnmqAH vptestnmqAH vptestnmqH vptestnmqHIhttps://www.felixcloutier.com/x86/vptestnmb:vptestnmw:vptestnmd:vptestnmqCMOVBCMOVBcmovbMove if below (CF == 1)cmovbw  cmovbw $cmovblcmovbl'VPROLDVPROLDvproldRotate Packed Doubleword Left vprold9Hvprold:Hvprold;HvproldHvproldHvproldHvprold9HvproldHvprold:HvproldHvprold;HvproldH?https://www.felixcloutier.com/x86/vprold:vprolvd:vprolq:vprolvqCLZEROCLZEROclzeroZero-out 64-bit Cache Lineclzero? VADDSUBPS VADDSUBPS vaddsubpsPacked Single-FP Add/Subtract vaddsubps  vaddsubps/  vaddsubps  vaddsubps2 VPHSUBWDVPHSUBWDvphsubwd;Packed Horizontal Subtract Signed Word to Signed Doublewordvphsubwd"vphsubwd/"VPOPCNTQVPOPCNTQvpopcntq-Packed Population Count for Quadword Integers vpopcntq=Kvpopcntq?KvpopcntqAPvpopcntqKvpopcntqKvpopcntqPvpopcntq=KvpopcntqKvpopcntq?KvpopcntqKvpopcntqAPvpopcntqPKNOTDKNOTDknotdNOT 32-bit Mask RegisterknotdI9https://www.felixcloutier.com/x86/knotw:knotb:knotq:knotdPADDWPADDWpaddwAdd Packed Word Integerspaddw paddw+ paddwpaddw/9https://www.felixcloutier.com/x86/paddb:paddw:paddd:paddq VPMOVMSKB VPMOVMSKB vpmovmskbMove Byte Mask vpmovmskb  vpmovmskb! VFNMSUBSS VFNMSUBSS vfnmsubssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfnmsubss$ vfnmsubss'$ vfnmsubss'$PFCMPGTPFCMPGTpfcmpgt.Packed Floating-Point Compare for Greater Thanpfcmpgtpfcmpgt+ VFMSUB231SD VFMSUB231SD vfmsub231sdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfmsub231sdH vfmsub231sd+H vfmsub231sd# vfmsub231sdH vfmsub231sd+# vfmsub231sd+H vfmsub231sdQH vfmsub231sdQHEhttps://www.felixcloutier.com/x86/vfmsub132sd:vfmsub213sd:vfmsub231sdRDRANDRDRANDrdrandRead Random Numberrdrand *rdrand*(https://www.felixcloutier.com/x86/rdrand VGETMANTSS VGETMANTSS vgetmantssMExtract Normalized Mantissa from Scalar Single-Precision Floating-Point Value vgetmantssH vgetmantss'H vgetmantssH vgetmantss'H vgetmantssRH vgetmantssRH,https://www.felixcloutier.com/x86/vgetmantssVPCMPEQDVPCMPEQDvpcmpeqd+Compare Packed Doubleword Data for Equalityvpcmpeqd9Hvpcmpeqd9HvpcmpeqdHvpcmpeqdHvpcmpeqd:Hvpcmpeqd:HvpcmpeqdHvpcmpeqdHvpcmpeqd;Hvpcmpeqd;HvpcmpeqdHvpcmpeqdHvpcmpeqd vpcmpeqd/ vpcmpeqd!vpcmpeqd2! VPTESTNMD VPTESTNMD vptestnmd=Logical NAND of Packed Doubleword Integer Values and Set Mask  vptestnmd9H vptestnmd9H vptestnmdH vptestnmdH vptestnmd:H vptestnmd:H vptestnmdH vptestnmdH vptestnmd;H vptestnmd;H vptestnmdH vptestnmdHIhttps://www.felixcloutier.com/x86/vptestnmb:vptestnmw:vptestnmd:vptestnmqKADDWKADDWkaddwADD Two 16-bit MaskskaddwJ9https://www.felixcloutier.com/x86/kaddw:kaddb:kaddq:kadddJNBEJNBEjnbe0Jump if not below or equal (CF == 0 and ZF == 0)jnbeJHINjnbeJHIONOPNOPnop No OperationnopNOP%https://www.felixcloutier.com/x86/nop VCVTSH2SI VCVTSH2SI vcvtsh2si7Convert Scalar Half-Precision FP Value to Dword Integer vcvtsh2siR vcvtsh2si$R vcvtsh2siQR+https://www.felixcloutier.com/x86/vcvtsh2si VGETEXPPH VGETEXPPH vgetexpphhExtract Exponents of Packed Half-Precision Floating-Point Values as Half-Precision Floating-Point Values vgetexpph<K vgetexpph>K vgetexpph@R vgetexpphK vgetexpphK vgetexpphR vgetexpph<K vgetexpphK vgetexpph>K vgetexpphK vgetexpph@R vgetexpphR vgetexpphRR vgetexpphRR+https://www.felixcloutier.com/x86/vgetexpphVFMSUBADD231PSVFMSUBADD231PSvfmsubadd231psXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubadd231ps9Hvfmsubadd231psHvfmsubadd231ps:Hvfmsubadd231psHvfmsubadd231ps;Hvfmsubadd231psHvfmsubadd231ps9Hvfmsubadd231ps#vfmsubadd231psHvfmsubadd231ps/#vfmsubadd231ps:Hvfmsubadd231ps#vfmsubadd231psHvfmsubadd231ps2#vfmsubadd231ps;Hvfmsubadd231psHvfmsubadd231psQHvfmsubadd231psQHNhttps://www.felixcloutier.com/x86/vfmsubadd132ps:vfmsubadd213ps:vfmsubadd231ps PUNPCKLWD PUNPCKLWD punpcklwd6Unpack and Interleave Low-Order Words into Doublewords punpcklwd  punpcklwd'  punpcklwd punpcklwd/Jhttps://www.felixcloutier.com/x86/punpcklbw:punpcklwd:punpckldq:punpcklqdqVBLENDPSVBLENDPSvblendps4 Blend Packed Single Precision Floating-Point Valuesvblendps vblendps/ vblendps vblendps2  VRNDSCALESS VRNDSCALESS vrndscaless]Round Scalar Single-Precision Floating-Point Value To Include A Given Number Of Fraction Bits vrndscalessH vrndscaless'H vrndscalessH vrndscaless'H vrndscalessRH vrndscalessRH-https://www.felixcloutier.com/x86/vrndscalessPINSRBPINSRBpinsrb Insert Bytepinsrbpinsrb#6https://www.felixcloutier.com/x86/pinsrb:pinsrd:pinsrq VFMADD231PH VFMADD231PH vfmadd231phAFused Multiply-Add of Packed Half-Precision Floating-Point Values vfmadd231ph<K vfmadd231phK vfmadd231ph>K vfmadd231phK vfmadd231ph@R vfmadd231phR vfmadd231ph<K vfmadd231phK vfmadd231ph>K vfmadd231phK vfmadd231ph@R vfmadd231phR vfmadd231phQR vfmadd231phQRlhttps://www.felixcloutier.com/x86/vfmadd132ph:vfnmadd132ph:vfmadd213ph:vfnmadd213ph:vfmadd231ph:vfnmadd231ph VCVTPD2UQQ VCVTPD2UQQ vcvtpd2uqqZConvert Packed Double-Precision Floating-Point Values to Packed Unsigned Quadword Integers vcvtpd2uqq=J vcvtpd2uqq?J vcvtpd2uqqAJ vcvtpd2uqqJ vcvtpd2uqqJ vcvtpd2uqqJ vcvtpd2uqq=J vcvtpd2uqqJ vcvtpd2uqq?J vcvtpd2uqqJ vcvtpd2uqqAJ vcvtpd2uqqJ vcvtpd2uqqQJ vcvtpd2uqqQJ,https://www.felixcloutier.com/x86/vcvtpd2uqq VPCMPESTRM VPCMPESTRM vpcmpestrm3Packed Compare Explicit Length Strings, Return Mask vpcmpestrml  vpcmpestrml/ RSQRTPSRSQRTPSrsqrtpsTCompute Reciprocals of Square Roots of Packed Single-Precision Floating-Point ValuesrsqrtpsRSQRTPSrsqrtpsRSQRTPS/)https://www.felixcloutier.com/x86/rsqrtps PCMPISTRM PCMPISTRM pcmpistrm3Packed Compare Implicit Length Strings, Return Mask pcmpistrm pcmpistrm/+https://www.felixcloutier.com/x86/pcmpistrmBLENDPDBLENDPDblendpd3Blend Packed Double Precision Floating-Point Valuesblendpdblendpd/)https://www.felixcloutier.com/x86/blendpdMOVDDUPMOVDDUPmovddup Move One Double-FP and Duplicatemovddupmovddup+)https://www.felixcloutier.com/x86/movddup VEXTRACTI32X4 VEXTRACTI32X4 vextracti32x44Extract 128 Bits of Packed Doubleword Integer Values vextracti32x4H vextracti32x40H vextracti32x4H vextracti32x40H vextracti32x4H vextracti32x4H vextracti32x4/H vextracti32x4/HVPSHLQVPSHLQvpshlqPacked Shift Logical Quadwordsvpshlq"vpshlq/"vpshlq/"MOVLPSMOVLPSmovlps6Move Low Packed Single-Precision Floating-Point ValuesmovlpsMOVLPS+movlpsMOVLPS+(https://www.felixcloutier.com/x86/movlps VFMSUB213SD VFMSUB213SD vfmsub213sdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfmsub213sdH vfmsub213sd+H vfmsub213sd# vfmsub213sdH vfmsub213sd+# vfmsub213sd+H vfmsub213sdQH vfmsub213sdQHEhttps://www.felixcloutier.com/x86/vfmsub132sd:vfmsub213sd:vfmsub231sdVPADDWVPADDWvpaddwAdd Packed Word IntegersvpaddwIvpaddw/IvpaddwIvpaddw2IvpaddwIvpaddw5Ivpaddw vpaddwIvpaddw/ vpaddw/Ivpaddw!vpaddwIvpaddw2!vpaddw2IvpaddwIvpaddw5I VFMSUBADDPD VFMSUBADDPD vfmsubaddpdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Values vfmsubaddpd$ vfmsubaddpd/$ vfmsubaddpd/$ vfmsubaddpd$ vfmsubaddpd2$ vfmsubaddpd2$VPCMPEQQVPCMPEQQvpcmpeqq)Compare Packed Quadword Data for Equalityvpcmpeqq=Hvpcmpeqq=HvpcmpeqqHvpcmpeqqHvpcmpeqq?Hvpcmpeqq?HvpcmpeqqHvpcmpeqqHvpcmpeqqAHvpcmpeqqAHvpcmpeqqHvpcmpeqqHvpcmpeqq vpcmpeqq/ vpcmpeqq!vpcmpeqq2!PSUBSBPSUBSBpsubsb;Subtract Packed Signed Byte Integers with Signed SaturationpsubsbPSUBSB psubsbPSUBSB+ psubsbPSUBSBpsubsbPSUBSB//https://www.felixcloutier.com/x86/psubsb:psubswVPMAXSWVPMAXSWvpmaxsw&Maximum of Packed Signed Word IntegersvpmaxswIvpmaxsw/IvpmaxswIvpmaxsw2IvpmaxswIvpmaxsw5Ivpmaxsw vpmaxswIvpmaxsw/ vpmaxsw/Ivpmaxsw!vpmaxswIvpmaxsw2!vpmaxsw2IvpmaxswIvpmaxsw5IMOVHLPSMOVHLPSmovhlps>Move Packed Single-Precision Floating-Point Values High to LowmovhlpsMOVHLPS)https://www.felixcloutier.com/x86/movhlps VMOVDQU32 VMOVDQU32 vmovdqu32 Move Unaligned Doubleword Values vmovdqu320H vmovdqu32H vmovdqu323H vmovdqu32H vmovdqu326H vmovdqu32H vmovdqu32/H vmovdqu322H vmovdqu325H vmovdqu32H vmovdqu32/H vmovdqu32H vmovdqu322H vmovdqu32H vmovdqu325H vmovdqu32/H vmovdqu322H vmovdqu325HOhttps://www.felixcloutier.com/x86/movdqu:vmovdqu8:vmovdqu16:vmovdqu32:vmovdqu64PMOVSXWDPMOVSXWDpmovsxwdDMove Packed Word Integers to Doubleword Integers with Sign Extensionpmovsxwdpmovsxwd+BLSICBLSICblsic%Isolate Lowest Set Bit and Complementblsic6blsic'6ORORorLogical Inclusive ORorbORBorbORB orbORB  orbORB #orwORW orwORW orwORW orwORW  orwORW $orlORLorlORLorlORLorlORLorlORL'orbORB#orbORB# orwORW$orwORW$orwORW$ orlORL'orlORL'orlORL'$https://www.felixcloutier.com/x86/orSETBESETBEsetbe/Set byte if below or equal (CF == 1 or ZF == 1)setbeSETLS setbeSETLS#PMADDWDPMADDWDpmaddwd,Multiply and Add Packed Signed Word Integerspmaddwd pmaddwd+ pmaddwdpmaddwd/)https://www.felixcloutier.com/x86/pmaddwdSHUFPDSHUFPDshufpd5Shuffle Packed Double-Precision Floating-Point Valuesshufpdshufpd/(https://www.felixcloutier.com/x86/shufpd VPCLMULQDQ VPCLMULQDQ vpclmulqdq"Carry-Less Quadword Multiplication  vpclmulqdq  vpclmulqdqK vpclmulqdq/  vpclmulqdq/K vpclmulqdq vpclmulqdqK vpclmulqdq2 vpclmulqdq2K vpclmulqdqH vpclmulqdq5HVPCOMUWVPCOMUWvpcomuw%Compare Packed Unsigned Word Integersvpcomuw"vpcomuw/" VPDPWSSDS VPDPWSSDS vpdpwssdsVPacked Dot Product of Signed-by-Signed Word subvectors into Doubleword with Saturation vpdpwssds9K vpdpwssdsK vpdpwssds:K vpdpwssdsK vpdpwssds;V vpdpwssdsV vpdpwssds9K vpdpwssdsW vpdpwssdsK vpdpwssds/W vpdpwssds:K vpdpwssdsW vpdpwssdsK vpdpwssds2W vpdpwssds;V vpdpwssdsV+https://www.felixcloutier.com/x86/vpdpwssdsPSIGNWPSIGNWpsignwPacked Sign of Word Integerspsignwpsignw+psignwpsignw/6https://www.felixcloutier.com/x86/psignb:psignw:psignd VFMSUB231SH VFMSUB231SH vfmsub231shFFused Multiply-Subtract of Scalar Half-Precision Floating-Point Values vfmsub231shR vfmsub231sh$R vfmsub231shR vfmsub231sh$R vfmsub231shQR vfmsub231shQRlhttps://www.felixcloutier.com/x86/vfmsub132sh:vfnmsub132sh:vfmsub213sh:vfnmsub213sh:vfmsub231sh:vfnmsub231shVPTESTMWVPTESTMWvptestmw6Logical AND of Packed Word Integer Values and Set Mask vptestmwIvptestmwIvptestmw/Ivptestmw/IvptestmwIvptestmwIvptestmw2Ivptestmw2IvptestmwIvptestmwIvptestmw5Ivptestmw5IEhttps://www.felixcloutier.com/x86/vptestmb:vptestmw:vptestmd:vptestmq VRSQRT28PS VRSQRT28PS vrsqrt28ps€Approximation to the Reciprocal Square Root of Packed Single-Precision Floating-Point Values with Less Than 2^-28 Relative Error vrsqrt28ps;M vrsqrt28psM vrsqrt28ps;M vrsqrt28psM vrsqrt28psRM vrsqrt28psRM,https://www.felixcloutier.com/x86/vrsqrt28psPCMPEQWPCMPEQWpcmpeqw%Compare Packed Word Data for Equalitypcmpeqw pcmpeqw+ pcmpeqwpcmpeqw/9https://www.felixcloutier.com/x86/pcmpeqb:pcmpeqw:pcmpeqd VGATHERPF1DPS VGATHERPF1DPS vgatherpf1dpsoSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint vgatherpf1dpsKLYhttps://www.felixcloutier.com/x86/vgatherpf1dps:vgatherpf1qps:vgatherpf1dpd:vgatherpf1qpdBLCIBLCIblciIsolate Lowest Clear Bitblci6blci'6VPCMPWVPCMPWvpcmpw!Compare Packed Signed Word Values vpcmpwIvpcmpwIvpcmpw/Ivpcmpw/IvpcmpwIvpcmpwIvpcmpw2Ivpcmpw2IvpcmpwIvpcmpwIvpcmpw5Ivpcmpw5I0https://www.felixcloutier.com/x86/vpcmpw:vpcmpuw VPGATHERQQ VPGATHERQQ vpgatherqq;Gather Packed Quadword Values Using Signed Quadword Indices vpgatherqqDH vpgatherqqHH vpgatherqqLH vpgatherqqD! vpgatherqqH!7https://www.felixcloutier.com/x86/vpgatherqd:vpgatherqqPFMAXPFMAXpfmaxPacked Floating-Point Maximumpfmaxpfmax+ VBROADCASTSS VBROADCASTSS vbroadcastss1Broadcast Single-Precision Floating-Point Element  vbroadcastssH vbroadcastssH vbroadcastss'H vbroadcastss'H vbroadcastss! vbroadcastss'  vbroadcastss! vbroadcastssH vbroadcastss'  vbroadcastss'H vbroadcastssH vbroadcastss'HVCOMISSVCOMISSvcomissLCompare Scalar Ordered Single-Precision Floating-Point Values and Set EFLAGSvcomiss vcomissHvcomiss' vcomiss'HvcomissRH VCVTSI2SH VCVTSI2SH vcvtsi2sh7Convert Dword Integer to Scalar Half-Precision FP Value vcvtsi2shlR vcvtsi2shl'R vcvtsi2shlQR+https://www.felixcloutier.com/x86/vcvtsi2shMAXPDMAXPDmaxpd<Return Maximum Packed Double-Precision Floating-Point ValuesmaxpdMAXPDmaxpdMAXPD/'https://www.felixcloutier.com/x86/maxpd VGETEXPPD VGETEXPPD vgetexppdlExtract Exponents of Packed Double-Precision Floating-Point Values as Double-Precision Floating-Point Values vgetexppd=H vgetexppd?H vgetexppdAH vgetexppdH vgetexppdH vgetexppdH vgetexppd=H vgetexppdH vgetexppd?H vgetexppdH vgetexppdAH vgetexppdH vgetexppdRH vgetexppdRH+https://www.felixcloutier.com/x86/vgetexppdKXORWKXORWkxorw Bitwise Logical XOR 16-bit MaskskxorwH9https://www.felixcloutier.com/x86/kxorw:kxorb:kxorq:kxordVMULSDVMULSDvmulsd6Multiply Scalar Double-Precision Floating-Point ValuesvmulsdHvmulsd+Hvmulsd vmulsdHvmulsd+ vmulsd+HvmulsdQHvmulsdQH VFIXUPIMMPS VFIXUPIMMPS vfixupimmps<Fix Up Special Packed Single-Precision Floating-Point Values vfixupimmps9K vfixupimmpsK vfixupimmps:H vfixupimmpsH vfixupimmps;H vfixupimmpsH vfixupimmps9K vfixupimmpsK vfixupimmps:H vfixupimmpsH vfixupimmps;H vfixupimmpsH vfixupimmpsRH vfixupimmpsRH-https://www.felixcloutier.com/x86/vfixupimmps VFNMSUB231PH VFNMSUB231PH vfnmsub231phOFused Negative Multiply-Subtract of Packed Half-Precision Floating-Point Values vfnmsub231ph<K vfnmsub231phK vfnmsub231ph>K vfnmsub231phK vfnmsub231ph@R vfnmsub231phR vfnmsub231ph<K vfnmsub231phK vfnmsub231ph>K vfnmsub231phK vfnmsub231ph@R vfnmsub231phR vfnmsub231phQR vfnmsub231phQRlhttps://www.felixcloutier.com/x86/vfmsub132ph:vfnmsub132ph:vfmsub213ph:vfnmsub213ph:vfmsub231ph:vfnmsub231phVPCMPGTQVPCMPGTQvpcmpgtq$Compare Packed Data for Greater Thanvpcmpgtq=Hvpcmpgtq=HvpcmpgtqHvpcmpgtqHvpcmpgtq?Hvpcmpgtq?HvpcmpgtqHvpcmpgtqHvpcmpgtqAHvpcmpgtqAHvpcmpgtqHvpcmpgtqHvpcmpgtq vpcmpgtq/ vpcmpgtq!vpcmpgtq2!BTSBTSbtsBit Test and SetbtswBTSW btswBTSW  btslBTSLbtslBTSLbtswBTSW$btswBTSW$ btslBTSL'btslBTSL'%https://www.felixcloutier.com/x86/btsSETOSETOsetoSet byte if overflow (OF == 1)setoSETOS setoSETOS#KSHIFTRDKSHIFTRDkshiftrdShift Right 32-bit MaskskshiftrdIEhttps://www.felixcloutier.com/x86/kshiftrw:kshiftrb:kshiftrq:kshiftrd VCVTQQ2PH VCVTQQ2PH vcvtqq2phOConvert Packed Quadword Integers to Packed Half-Precision Floating-Point Values vcvtqq2phx=K vcvtqq2phy?K vcvtqq2phzAR vcvtqq2phxK vcvtqq2phyK vcvtqq2phzR vcvtqq2phx=K vcvtqq2phy?K vcvtqq2phzAR vcvtqq2phxK vcvtqq2phyK vcvtqq2phzR vcvtqq2phzQR vcvtqq2phzQR+https://www.felixcloutier.com/x86/vcvtqq2phVPRORVDVPRORVDvprorvd'Variable Rotate Packed Doubleword Right vprorvd9HvprorvdHvprorvd:HvprorvdHvprorvd;HvprorvdHvprorvd9HvprorvdHvprorvd:HvprorvdHvprorvd;HvprorvdH?https://www.felixcloutier.com/x86/vprord:vprorvd:vprorq:vprorvqPCMPGTBPCMPGTBpcmpgtb4Compare Packed Signed Byte Integers for Greater Thanpcmpgtb pcmpgtb+ pcmpgtbpcmpgtb/9https://www.felixcloutier.com/x86/pcmpgtb:pcmpgtw:pcmpgtdVSQRTPDVSQRTPDvsqrtpdECompute Square Roots of Packed Double-Precision Floating-Point Valuesvsqrtpd9Hvsqrtpd:HvsqrtpdAHvsqrtpdHvsqrtpdHvsqrtpdHvsqrtpd9Hvsqrtpd vsqrtpdHvsqrtpd/ vsqrtpd:Hvsqrtpd vsqrtpdHvsqrtpd2 vsqrtpdAHvsqrtpdHvsqrtpdQHvsqrtpdQHVPSLLVWVPSLLVWvpsllvw,Variable Shift Packed Word Data Left Logical vpsllvwIvpsllvw/IvpsllvwIvpsllvw2IvpsllvwIvpsllvw5IvpsllvwIvpsllvw/IvpsllvwIvpsllvw2IvpsllvwIvpsllvw5I9https://www.felixcloutier.com/x86/vpsllvw:vpsllvd:vpsllvq VADDSUBPD VADDSUBPD vaddsubpdPacked Double-FP Add/Subtract vaddsubpd  vaddsubpd/  vaddsubpd  vaddsubpd2  VCVTUDQ2PD VCVTUDQ2PD vcvtudq2pd\Convert Packed Unsigned Doubleword Integers to Packed Double-Precision Floating-Point Values  vcvtudq2pd8H vcvtudq2pd9H vcvtudq2pd:H vcvtudq2pdH vcvtudq2pdH vcvtudq2pdH vcvtudq2pd8H vcvtudq2pdH vcvtudq2pd9H vcvtudq2pdH vcvtudq2pd:H vcvtudq2pdH,https://www.felixcloutier.com/x86/vcvtudq2pdPSLLDPSLLDpslld)Shift Packed Doubleword Data Left Logicalpslld pslld pslld+ pslldpslldpslld/3https://www.felixcloutier.com/x86/psllw:pslld:psllqVBROADCASTI128VBROADCASTI128vbroadcasti128"Broadcast 128 Bits of Integer Datavbroadcasti128/!VFMADDSUB132PSVFMADDSUB132PSvfmaddsub132psXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsub132ps9Hvfmaddsub132psHvfmaddsub132ps:Hvfmaddsub132psHvfmaddsub132ps;Hvfmaddsub132psHvfmaddsub132ps9Hvfmaddsub132ps#vfmaddsub132psHvfmaddsub132ps/#vfmaddsub132ps:Hvfmaddsub132ps#vfmaddsub132psHvfmaddsub132ps2#vfmaddsub132ps;Hvfmaddsub132psHvfmaddsub132psQHvfmaddsub132psQHNhttps://www.felixcloutier.com/x86/vfmaddsub132ps:vfmaddsub213ps:vfmaddsub231psVROUNDPSVROUNDPSvroundps3Round Packed Single Precision Floating-Point Valuesvroundps vroundps/ vroundps vroundps2  VGATHERQPS VGATHERQPS vgatherqpsRGather Packed Single-Precision Floating-Point Values Using Signed Quadword Indices vgatherqpsDH vgatherqpsHH vgatherqpsLH vgatherqpsD! vgatherqpsH!7https://www.felixcloutier.com/x86/vgatherqps:vgatherqpdSQRTSSSQRTSSsqrtssCCompute Square Root of Scalar Single-Precision Floating-Point ValuesqrtssSQRTSSsqrtssSQRTSS'(https://www.felixcloutier.com/x86/sqrtssVPSRLVQVPSRLVQvpsrlvq1Variable Shift Packed Quadword Data Right Logicalvpsrlvq=HvpsrlvqHvpsrlvq?HvpsrlvqHvpsrlvqAHvpsrlvqHvpsrlvq=Hvpsrlvq!vpsrlvqHvpsrlvq/!vpsrlvq?Hvpsrlvq!vpsrlvqHvpsrlvq2!vpsrlvqAHvpsrlvqH9https://www.felixcloutier.com/x86/vpsrlvw:vpsrlvd:vpsrlvqVRCPPSVRCPPSvrcppsPCompute Approximate Reciprocals of Packed Single-Precision Floating-Point Valuesvrcpps vrcpps/ vrcpps vrcpps2 VPOPCNTDVPOPCNTDvpopcntd/Packed Population Count for Doubleword Integers vpopcntd9Kvpopcntd:Kvpopcntd;PvpopcntdKvpopcntdKvpopcntdPvpopcntd9KvpopcntdKvpopcntd:KvpopcntdKvpopcntd;PvpopcntdP VMOVDQA64 VMOVDQA64 vmovdqa64Move Aligned Quadword Values vmovdqa640H vmovdqa64H vmovdqa643H vmovdqa64H vmovdqa646H vmovdqa64H vmovdqa64/H vmovdqa642H vmovdqa645H vmovdqa64H vmovdqa64/H vmovdqa64H vmovdqa642H vmovdqa64H vmovdqa645H vmovdqa64/H vmovdqa642H vmovdqa645H<https://www.felixcloutier.com/x86/movdqa:vmovdqa32:vmovdqa64VMOVSDVMOVSDvmovsd1Move Scalar Double-Precision Floating-Point Value vmovsd,Hvmovsd+Hvmovsd+ vmovsd+Hvmovsd+ vmovsd+HvmovsdHvmovsd vmovsdH VINSERTF32X8 VINSERTF32X8 vinsertf32x8@Insert 256 Bits of Packed Single-Precision Floating-Point Values vinsertf32x8J vinsertf32x82J vinsertf32x8J vinsertf32x82J VFNMSUB213SD VFNMSUB213SD vfnmsub213sdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfnmsub213sdH vfnmsub213sd+H vfnmsub213sd# vfnmsub213sdH vfnmsub213sd+# vfnmsub213sd+H vfnmsub213sdQH vfnmsub213sdQHHhttps://www.felixcloutier.com/x86/vfnmsub132sd:vfnmsub213sd:vfnmsub231sd VINSERTF32X4 VINSERTF32X4 vinsertf32x4@Insert 128 Bits of Packed Single-Precision Floating-Point Values vinsertf32x4H vinsertf32x4/H vinsertf32x4H vinsertf32x4/H vinsertf32x4H vinsertf32x4/H vinsertf32x4H vinsertf32x4/HVFMSUBADD213PHVFMSUBADD213PHvfmsubadd213phVFused Multiply-Alternating Subtract/Add of Packed Half-Precision Floating-Point Valuesvfmsubadd213ph<Kvfmsubadd213phKvfmsubadd213ph>Kvfmsubadd213phKvfmsubadd213ph@Rvfmsubadd213phRvfmsubadd213ph<Kvfmsubadd213phKvfmsubadd213ph>Kvfmsubadd213phKvfmsubadd213ph@Rvfmsubadd213phRvfmsubadd213phQRvfmsubadd213phQRNhttps://www.felixcloutier.com/x86/vfmsubadd132ph:vfmsubadd213ph:vfmsubadd231ph VCVTPS2PD VCVTPS2PD vcvtps2pdNConvert Packed Single-Precision FP Values to Packed Double-Precision FP Values vcvtps2pd8H vcvtps2pd9K vcvtps2pd:H vcvtps2pdH vcvtps2pdK vcvtps2pdH vcvtps2pd8H vcvtps2pd  vcvtps2pdH vcvtps2pd+  vcvtps2pd9K vcvtps2pd  vcvtps2pdK vcvtps2pd/  vcvtps2pd:H vcvtps2pdH vcvtps2pdRH vcvtps2pdRH MOVDIR64B MOVDIR64B movdir64bMOVe to DIRect store 64 Bytes movdir64b51+https://www.felixcloutier.com/x86/movdir64b VPACKUSDW VPACKUSDW vpackusdw4Pack Doublewords into Words with Unsigned Saturation vpackusdw9I vpackusdwI vpackusdw:I vpackusdwI vpackusdw;I vpackusdwI vpackusdw9I vpackusdw  vpackusdwI vpackusdw/  vpackusdw:I vpackusdw! vpackusdwI vpackusdw2! vpackusdw;I vpackusdwIVPCMPUQVPCMPUQvpcmpuq'Compare Packed Unsigned Quadword Values vpcmpuq=Hvpcmpuq=HvpcmpuqHvpcmpuqHvpcmpuq?Hvpcmpuq?HvpcmpuqHvpcmpuqHvpcmpuqAHvpcmpuqAHvpcmpuqHvpcmpuqH0https://www.felixcloutier.com/x86/vpcmpq:vpcmpuqROUNDPDROUNDPDroundpd3Round Packed Double Precision Floating-Point Valuesroundpdroundpd/)https://www.felixcloutier.com/x86/roundpd VFNMADD132SD VFNMADD132SD vfnmadd132sdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Values vfnmadd132sdH vfnmadd132sd+H vfnmadd132sd# vfnmadd132sdH vfnmadd132sd+# vfnmadd132sd+H vfnmadd132sdQH vfnmadd132sdQHHhttps://www.felixcloutier.com/x86/vfnmadd132sd:vfnmadd213sd:vfnmadd231sdPFRSQRTVPFRSQRTVpfrsqrtvAPacked Floating-Point Reciprocal Square Root Approximation Vectorpfrsqrtvpfrsqrtv+ VFNMADDSD VFNMADDSD vfnmaddsdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Values vfnmaddsd$ vfnmaddsd+$ vfnmaddsd+$VPSRLWVPSRLWvpsrlw$Shift Packed Word Data Right LogicalvpsrlwIvpsrlwIvpsrlw/IvpsrlwIvpsrlwIvpsrlw/IvpsrlwIvpsrlwIvpsrlw/Ivpsrlw/Ivpsrlw2Ivpsrlw5Ivpsrlw vpsrlwIvpsrlw vpsrlwIvpsrlw/ vpsrlw/Ivpsrlw/Ivpsrlw!vpsrlwIvpsrlw!vpsrlwIvpsrlw/!vpsrlw/Ivpsrlw2IvpsrlwIvpsrlwIvpsrlw/Ivpsrlw5IBTCBTCbtcBit Test and ComplementbtcwBTCW btcwBTCW  btclBTCLbtclBTCLbtcwBTCW$btcwBTCW$ btclBTCL'btclBTCL'%https://www.felixcloutier.com/x86/btcVXORPSVXORPSvxorps>Bitwise Logical XOR for Single-Precision Floating-Point Valuesvxorps9JvxorpsJvxorps:JvxorpsJvxorps;JvxorpsJvxorps9Jvxorps vxorpsJvxorps/ vxorps:Jvxorps vxorpsJvxorps2 vxorps;JvxorpsJVPMOVDWVPMOVDWvpmovdwDDown Convert Packed Doubleword Values to Word Values with Truncation vpmovdwHvpmovdw,HvpmovdwHvpmovdw0HvpmovdwHvpmovdw3HvpmovdwHvpmovdwHvpmovdwHvpmovdw+Hvpmovdw/Hvpmovdw2H<https://www.felixcloutier.com/x86/vpmovdw:vpmovsdw:vpmovusdwVPHSUBDQVPHSUBDQvphsubdq?Packed Horizontal Subtract Signed Doubleword to Signed Quadwordvphsubdq"vphsubdq/"PMINUBPMINUBpminub(Minimum of Packed Unsigned Byte IntegerspminubPMINUB pminubPMINUB+ pminubPMINUBpminubPMINUB//https://www.felixcloutier.com/x86/pminub:pminuw VINSERTI128 VINSERTI128 vinserti128Insert Packed Integer Values vinserti128! vinserti128/!ahttps://www.felixcloutier.com/x86/vinserti128:vinserti32x4:vinserti64x2:vinserti32x8:vinserti64x4VFMSUBPDVFMSUBPDvfmsubpdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfmsubpd$vfmsubpd/$vfmsubpd/$vfmsubpd$vfmsubpd2$vfmsubpd2$PADDUSWPADDUSWpaddusw:Add Packed Unsigned Word Integers with Unsigned Saturationpaddusw paddusw+ padduswpaddusw/1https://www.felixcloutier.com/x86/paddusb:padduswBLCICBLCICblcic%Isolate Lowest Set Bit and Complementblcic6blcic'6PSIGNDPSIGNDpsignd"Packed Sign of Doubleword Integerspsigndpsignd+psigndpsignd/6https://www.felixcloutier.com/x86/psignb:psignw:psignd PUNPCKLBW PUNPCKLBW punpcklbw0Unpack and Interleave Low-Order Bytes into Words punpcklbw  punpcklbw'  punpcklbw punpcklbw/Jhttps://www.felixcloutier.com/x86/punpcklbw:punpcklwd:punpckldq:punpcklqdqVGF2P8AFFINEQBVGF2P8AFFINEQBvgf2p8affineqb(Galois Field (2^8) Affine Transformationvgf2p8affineqb=vgf2p8affineqbvgf2p8affineqb?vgf2p8affineqbvgf2p8affineqbAvgf2p8affineqbvgf2p8affineqb=vgf2p8affineqbvgf2p8affineqbvgf2p8affineqb/vgf2p8affineqb?vgf2p8affineqbvgf2p8affineqbvgf2p8affineqb2vgf2p8affineqbAvgf2p8affineqbVPSHLWVPSHLWvpshlwPacked Shift Logical Wordsvpshlw"vpshlw/"vpshlw/" VFCMULCPH VFCMULCPH vfcmulcphOFused Conjugate Multiply of Complex Packed Half-Precision Floating-Point Values vfcmulcph9K vfcmulcphK vfcmulcph:K vfcmulcphK vfcmulcph;R vfcmulcphR vfcmulcph9K vfcmulcphK vfcmulcph:K vfcmulcphK vfcmulcph;R vfcmulcphR vfcmulcphQR vfcmulcphQR4https://www.felixcloutier.com/x86/vfcmulcph:vfmulcphVMOVNTPSVMOVNTPSvmovntpsKStore Packed Single-Precision Floating-Point Values Using Non-Temporal Hintvmovntps/ vmovntps/Hvmovntps2 vmovntps2Hvmovntps5H VSCALEFPH VSCALEFPH vscalefph[Scale Packed Half-Precision Floating-Point Values With Half-Precision Floating-Point Values vscalefph<K vscalefphK vscalefph>K vscalefphK vscalefph@R vscalefphR vscalefph<K vscalefphK vscalefph>K vscalefphK vscalefph@R vscalefphR vscalefphQR vscalefphQR+https://www.felixcloutier.com/x86/vscalefphPF2IDPF2IDpf2id5Packed Floating-Point to Integer Doubleword Conversonpf2idpf2id+PMOVMSKBPMOVMSKBpmovmskbMove Byte MaskpmovmskbPMOVMSKB pmovmskbPMOVMSKB*https://www.felixcloutier.com/x86/pmovmskb VPBROADCASTD VPBROADCASTD vpbroadcastdBroadcast Doubleword Integer vpbroadcastdH vpbroadcastdH vpbroadcastdH vpbroadcastdH vpbroadcastdH vpbroadcastdH vpbroadcastd'H vpbroadcastd'H vpbroadcastd'H vpbroadcastdH vpbroadcastd! vpbroadcastdH vpbroadcastd'! vpbroadcastd'H vpbroadcastdH vpbroadcastd! vpbroadcastdH vpbroadcastd'! vpbroadcastd'H vpbroadcastdH vpbroadcastdH vpbroadcastd'HUhttps://www.felixcloutier.com/x86/vpbroadcastb:vpbroadcastw:vpbroadcastd:vpbroadcastq VPHMINPOSUW VPHMINPOSUW vphminposuw3Packed Horizontal Minimum of Unsigned Word Integers vphminposuw  vphminposuw/  VPUNPCKHQDQ VPUNPCKHQDQ vpunpckhqdq@Unpack and Interleave High-Order Quadwords into Double Quadwords vpunpckhqdq=H vpunpckhqdqH vpunpckhqdq?H vpunpckhqdqH vpunpckhqdqAH vpunpckhqdqH vpunpckhqdq=H vpunpckhqdq  vpunpckhqdqH vpunpckhqdq/  vpunpckhqdq?H vpunpckhqdq! vpunpckhqdqH vpunpckhqdq2! vpunpckhqdqAH vpunpckhqdqHPMULUDQPMULUDQpmuludq,Multiply Packed Unsigned Doubleword Integerspmuludqpmuludq+pmuludqpmuludq/)https://www.felixcloutier.com/x86/pmuludqBLENDPSBLENDPSblendps4 Blend Packed Single Precision Floating-Point Valuesblendpsblendps/)https://www.felixcloutier.com/x86/blendpsVPMOVDBVPMOVDBvpmovdbDDown Convert Packed Doubleword Values to Byte Values with Truncation vpmovdbHvpmovdb(HvpmovdbHvpmovdb,HvpmovdbHvpmovdb0HvpmovdbHvpmovdbHvpmovdbHvpmovdb'Hvpmovdb+Hvpmovdb/H<https://www.felixcloutier.com/x86/vpmovdb:vpmovsdb:vpmovusdbVPMINUBVPMINUBvpminub(Minimum of Packed Unsigned Byte IntegersvpminubIvpminub/IvpminubIvpminub2IvpminubIvpminub5Ivpminub vpminubIvpminub/ vpminub/Ivpminub!vpminubIvpminub2!vpminub2IvpminubIvpminub5IKXORQKXORQkxorq Bitwise Logical XOR 64-bit MaskskxorqI9https://www.felixcloutier.com/x86/kxorw:kxorb:kxorq:kxordVPSUBSBVPSUBSBvpsubsb;Subtract Packed Signed Byte Integers with Signed SaturationvpsubsbIvpsubsb/IvpsubsbIvpsubsb2IvpsubsbIvpsubsb5Ivpsubsb vpsubsbIvpsubsb/ vpsubsb/Ivpsubsb!vpsubsbIvpsubsb2!vpsubsb2IvpsubsbIvpsubsb5ICMOVOCMOVOcmovoMove if overflow (OF == 1)cmovow  cmovow $cmovolcmovol'MINPDMINPDminpd<Return Minimum Packed Double-Precision Floating-Point ValuesminpdMINPDminpdMINPD/'https://www.felixcloutier.com/x86/minpdSQRTSDSQRTSDsqrtsdCCompute Square Root of Scalar Double-Precision Floating-Point ValuesqrtsdSQRTSDsqrtsdSQRTSD+(https://www.felixcloutier.com/x86/sqrtsd VPMACSSDD VPMACSSDD vpmacssddQPacked Multiply Accumulate with Saturation Signed Doubleword to Signed Doubleword vpmacssdd" vpmacssdd/" VPHADDUBW VPHADDUBW vphaddubw+Packed Horizontal Add Unsigned Byte to Word vphaddubw" vphaddubw/"KNOTQKNOTQknotqNOT 64-bit Mask RegisterknotqI9https://www.felixcloutier.com/x86/knotw:knotb:knotq:knotdPSHUFHWPSHUFHWpshufhwShuffle Packed High WordspshufhwPSHUFHWpshufhwPSHUFHW/)https://www.felixcloutier.com/x86/pshufhwKTESTDKTESTDktestd#Bit Test 32-bit Masks and Set FlagsktestdI=https://www.felixcloutier.com/x86/ktestw:ktestb:ktestq:ktestdVADDPDVADDPDvaddpd1Add Packed Double-Precision Floating-Point Valuesvaddpd=HvaddpdHvaddpd?HvaddpdHvaddpdAHvaddpdHvaddpd=Hvaddpd vaddpdHvaddpd/ vaddpd?Hvaddpd vaddpdHvaddpd2 vaddpdAHvaddpdHvaddpdQHvaddpdQH VCVTQQ2PS VCVTQQ2PS vcvtqq2psQConvert Packed Quadword Integers to Packed Single-Precision Floating-Point Values vcvtqq2psx=J vcvtqq2psy?J vcvtqq2psAJ vcvtqq2psxJ vcvtqq2psyJ vcvtqq2psJ vcvtqq2psx=J vcvtqq2psy?J vcvtqq2psxJ vcvtqq2psyJ vcvtqq2psAJ vcvtqq2psJ vcvtqq2psQJ vcvtqq2psQJ+https://www.felixcloutier.com/x86/vcvtqq2psVSM3MSG2VSM3MSG2vsm3msg2=Perform Final Calculation for the Next Four SM3 Message Wordsvsm3msg2vsm3msg2/PCMPEQQPCMPEQQpcmpeqq)Compare Packed Quadword Data for Equalitypcmpeqqpcmpeqq/)https://www.felixcloutier.com/x86/pcmpeqqADDSUBPSADDSUBPSaddsubpsPacked Single-FP Add/Subtractaddsubpsaddsubps/*https://www.felixcloutier.com/x86/addsubpsVPROTDVPROTDvprotdPacked Rotate Doublewordsvprotd"vprotd"vprotd/"vprotd/"vprotd/"VTESTPSVTESTPSvtestps/Packed Single-Precision Floating-Point Bit Testvtestps vtestps/ vtestps vtestps2 1https://www.felixcloutier.com/x86/vtestpd:vtestpsXORPSXORPSxorps>Bitwise Logical XOR for Single-Precision Floating-Point ValuesxorpsXORPSxorpsXORPS/'https://www.felixcloutier.com/x86/xorpsPSUBUSWPSUBUSWpsubusw?Subtract Packed Unsigned Word Integers with Unsigned SaturationpsubuswPSUBUSW psubuswPSUBUSW+ psubuswPSUBUSWpsubuswPSUBUSW/1https://www.felixcloutier.com/x86/psubusb:psubuswADDPDADDPDaddpd1Add Packed Double-Precision Floating-Point ValuesaddpdADDPDaddpdADDPD/'https://www.felixcloutier.com/x86/addpd AESDECLAST AESDECLAST aesdeclast,Perform Last Round of an AES Decryption Flow aesdeclast' aesdeclast/',https://www.felixcloutier.com/x86/aesdeclastCVTSD2SICVTSD2SIcvtsd2si3Convert Scalar Double-Precision FP Value to Integercvtsd2siCVTSD2SLcvtsd2siCVTSD2SL+*https://www.felixcloutier.com/x86/cvtsd2siJCJCjcJump if carry (CF == 1)jcJCSNjcJCSO PCMPESTRI PCMPESTRI pcmpestri4Packed Compare Explicit Length Strings, Return Index pcmpestril pcmpestril/+https://www.felixcloutier.com/x86/pcmpestriVFRCZPSVFRCZPSvfrczps7Extract Fraction Packed Single-Precision Floating-Pointvfrczps"vfrczps/"vfrczps"vfrczps2"BSWAPBSWAPbswap Byte Swapbswap'https://www.felixcloutier.com/x86/bswap VCVTSS2SD VCVTSS2SD vcvtss2sdLConvert Scalar Single-Precision FP Value to Scalar Double-Precision FP Value vcvtss2sdH vcvtss2sd'H vcvtss2sd  vcvtss2sdH vcvtss2sd'  vcvtss2sd'H vcvtss2sdRH vcvtss2sdRHPMOVZXBDPMOVZXBDpmovzxbdDMove Packed Byte Integers to Doubleword Integers with Zero Extensionpmovzxbdpmovzxbd'ADDSDADDSDaddsd1Add Scalar Double-Precision Floating-Point ValuesaddsdADDSDaddsdADDSD+'https://www.felixcloutier.com/x86/addsdCMPXCHGCMPXCHGcmpxchgCompare and ExchangecmpxchgbCMPXCHGB  cmpxchgwCMPXCHGW  cmpxchglCMPXCHGLcmpxchgbCMPXCHGB# cmpxchgwCMPXCHGW$ cmpxchglCMPXCHGL')https://www.felixcloutier.com/x86/cmpxchg VFPCLASSSS VFPCLASSSS vfpclassss:Test Class of Scalar Single-Precision Floating-Point Value vfpclassssJ vfpclassssJ vfpclassss'J vfpclassss'J,https://www.felixcloutier.com/x86/vfpclassss VFNMADD213PD VFNMADD213PD vfnmadd213pdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Values vfnmadd213pd=H vfnmadd213pdH vfnmadd213pd?H vfnmadd213pdH vfnmadd213pdAH vfnmadd213pdH vfnmadd213pd=H vfnmadd213pd# vfnmadd213pdH vfnmadd213pd/# vfnmadd213pd?H vfnmadd213pd# vfnmadd213pdH vfnmadd213pd2# vfnmadd213pdAH vfnmadd213pdH vfnmadd213pdQH vfnmadd213pdQHHhttps://www.felixcloutier.com/x86/vfnmadd132pd:vfnmadd213pd:vfnmadd231pd VPTERNLOGQ VPTERNLOGQ vpternlogq4Bitwise Ternary Logical Operation on Quadword Values  vpternlogq=H vpternlogqH vpternlogq?H vpternlogqH vpternlogqAH vpternlogqH vpternlogq=H vpternlogqH vpternlogq?H vpternlogqH vpternlogqAH vpternlogqH7https://www.felixcloutier.com/x86/vpternlogd:vpternlogqPACKSSWBPACKSSWBpacksswb,Pack Words into Bytes with Signed Saturationpacksswb packsswb+ packsswbpacksswb/3https://www.felixcloutier.com/x86/packsswb:packssdwSETNCSETNCsetncSet byte if not carry (CF == 0)setncSETCC setncSETCC#CMOVNPCMOVNPcmovnpMove if not parity (PF == 0)cmovnpw  cmovnpw $cmovnplcmovnpl'SETNOSETNOsetno"Set byte if not overflow (OF == 0)setnoSETOC setnoSETOC# VGETMANTPS VGETMANTPS vgetmantpsOExtract Normalized Mantissas from Packed Single-Precision Floating-Point Values vgetmantps9H vgetmantps:H vgetmantps;H vgetmantpsH vgetmantpsH vgetmantpsH vgetmantps9H vgetmantpsH vgetmantps:H vgetmantpsH vgetmantps;H vgetmantpsH vgetmantpsRH vgetmantpsRH,https://www.felixcloutier.com/x86/vgetmantpsVPTESTMQVPTESTMQvptestmq:Logical AND of Packed Quadword Integer Values and Set Mask vptestmq=Hvptestmq=HvptestmqHvptestmqHvptestmq?Hvptestmq?HvptestmqHvptestmqHvptestmqAHvptestmqAHvptestmqHvptestmqHEhttps://www.felixcloutier.com/x86/vptestmb:vptestmw:vptestmd:vptestmqDIVSDDIVSDdivsd4Divide Scalar Double-Precision Floating-Point ValuesdivsdDIVSDdivsdDIVSD+'https://www.felixcloutier.com/x86/divsdANDNPDANDNPDandnpdHBitwise Logical AND NOT of Packed Double-Precision Floating-Point ValuesandnpdANDNPDandnpdANDNPD/(https://www.felixcloutier.com/x86/andnpd VEXTRACTF32X4 VEXTRACTF32X4 vextractf32x4AExtract 128 Bits of Packed Single-Precision Floating-Point Values vextractf32x4H vextractf32x40H vextractf32x4H vextractf32x40H vextractf32x4H vextractf32x4H vextractf32x4/H vextractf32x4/H VPBLENDMW VPBLENDMW vpblendmw*Blend Word Vectors Using an OpMask Control  vpblendmwI vpblendmw/I vpblendmwI vpblendmw2I vpblendmwI vpblendmw5I vpblendmwI vpblendmw/I vpblendmwI vpblendmw2I vpblendmwI vpblendmw5I5https://www.felixcloutier.com/x86/vpblendmb:vpblendmwVPMAXUDVPMAXUDvpmaxud.Maximum of Packed Unsigned Doubleword Integersvpmaxud9HvpmaxudHvpmaxud:HvpmaxudHvpmaxud;HvpmaxudHvpmaxud9Hvpmaxud vpmaxudHvpmaxud/ vpmaxud:Hvpmaxud!vpmaxudHvpmaxud2!vpmaxud;HvpmaxudHDAADAAdaa Decimal Adjust AL after AdditiondaaDAA%https://www.felixcloutier.com/x86/daaKANDNDKANDNDkandnd$Bitwise Logical AND NOT 32-bit MaskskandndI=https://www.felixcloutier.com/x86/kandnw:kandnb:kandnq:kandndSHLSHLshlLogical Shift LeftshlbSHLB shlbSHLB shlbSHLB shlwSHLW shlwSHLW shlwSHLW shllSHLLshllSHLLshllSHLLshlbSHLB#shlbSHLB#shlbSHLB#shlwSHLW$shlwSHLW$shlwSHLW$shllSHLL'shllSHLL'shllSHLL'1https://www.felixcloutier.com/x86/sal:sar:shl:shr VGATHERPF0QPS VGATHERPF0QPS vgatherpf0qpsmSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint vgatherpf0qpsMLYhttps://www.felixcloutier.com/x86/vgatherpf0dps:vgatherpf0qps:vgatherpf0dpd:vgatherpf0qpdVDIVPDVDIVPDvdivpd4Divide Packed Double-Precision Floating-Point Valuesvdivpd=HvdivpdHvdivpd?HvdivpdHvdivpdAHvdivpdHvdivpd=Hvdivpd vdivpdHvdivpd/ vdivpd?Hvdivpd vdivpdHvdivpd2 vdivpdAHvdivpdHvdivpdQHvdivpdQHANDNANDNandnLogical AND NOTandnl4andnl'4&https://www.felixcloutier.com/x86/andnVPCOMUQVPCOMUQvpcomuq)Compare Packed Unsigned Quadword Integersvpcomuq"vpcomuq/" VFNMSUB132PS VFNMSUB132PS vfnmsub132psQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Values vfnmsub132ps9H vfnmsub132psH vfnmsub132ps:H vfnmsub132psH vfnmsub132ps;H vfnmsub132psH vfnmsub132ps9H vfnmsub132ps# vfnmsub132psH vfnmsub132ps/# vfnmsub132ps:H vfnmsub132ps# vfnmsub132psH vfnmsub132ps2# vfnmsub132ps;H vfnmsub132psH vfnmsub132psQH vfnmsub132psQHHhttps://www.felixcloutier.com/x86/vfnmsub132ps:vfnmsub213ps:vfnmsub231psVFRCZPDVFRCZPDvfrczpd7Extract Fraction Packed Double-Precision Floating-Pointvfrczpd"vfrczpd/"vfrczpd"vfrczpd2" VPERM2I128 VPERM2I128 vperm2i128Permute 128-Bit Integer Values vperm2i128! vperm2i1282!,https://www.felixcloutier.com/x86/vperm2i128PSADBWPSADBWpsadbw#Compute Sum of Absolute DifferencespsadbwPSADBW psadbwPSADBW+ psadbwPSADBWpsadbwPSADBW/(https://www.felixcloutier.com/x86/psadbwVMOVQVMOVQvmovq Move Quadwordvmovq vmovqHvmovq+ vmovq+Hvmovq+ vmovq+HCMCCMCcmcComplement Carry FlagcmcCMC%https://www.felixcloutier.com/x86/cmcPMAXUBPMAXUBpmaxub(Maximum of Packed Unsigned Byte IntegerspmaxubPMAXUB pmaxubPMAXUB+ pmaxubPMAXUBpmaxubPMAXUB//https://www.felixcloutier.com/x86/pmaxub:pmaxuw VCVTDQ2PH VCVTDQ2PH vcvtdq2ph@Convert Packed Dword Integers to Packed Half-Precision FP Values vcvtdq2phx9K vcvtdq2phy:K vcvtdq2ph;R vcvtdq2phxK vcvtdq2phyK vcvtdq2phR vcvtdq2phx9K vcvtdq2phy:K vcvtdq2phxK vcvtdq2phyK vcvtdq2ph;R vcvtdq2phR vcvtdq2phQR vcvtdq2phQR+https://www.felixcloutier.com/x86/vcvtdq2phINSERTPSINSERTPSinsertps3Insert Packed Single Precision Floating-Point Valueinsertpsinsertps'*https://www.felixcloutier.com/x86/insertpsJPEJPEjpeJump if parity even (PF == 1)jpeJPSNjpeJPSOVPSHABVPSHABvpshabPacked Shift Arithmetic Bytesvpshab"vpshab/"vpshab/" VCVTDQ2PD VCVTDQ2PD vcvtdq2pdBConvert Packed Dword Integers to Packed Double-Precision FP Values vcvtdq2pd8H vcvtdq2pd9H vcvtdq2pd:H vcvtdq2pdH vcvtdq2pdH vcvtdq2pdH vcvtdq2pd8H vcvtdq2pd  vcvtdq2pdH vcvtdq2pd+  vcvtdq2pd9H vcvtdq2pd  vcvtdq2pdH vcvtdq2pd/  vcvtdq2pd:H vcvtdq2pdHVPADDBVPADDBvpaddbAdd Packed Byte IntegersvpaddbIvpaddb/IvpaddbIvpaddb2IvpaddbIvpaddb5Ivpaddb vpaddbIvpaddb/ vpaddb/Ivpaddb!vpaddbIvpaddb2!vpaddb2IvpaddbIvpaddb5I VSHUFI32X4 VSHUFI32X4 vshufi32x40Shuffle 128-Bit Packed Doubleword Integer Values vshufi32x4:H vshufi32x4H vshufi32x4;H vshufi32x4H vshufi32x4:H vshufi32x4H vshufi32x4;H vshufi32x4H VCVTPS2QQ VCVTPS2QQ vcvtps2qq^Convert Packed Single Precision Floating-Point Values to Packed Singed Quadword Integer Values vcvtps2qq8J vcvtps2qq9J vcvtps2qq:J vcvtps2qqJ vcvtps2qqJ vcvtps2qqJ vcvtps2qq8J vcvtps2qqJ vcvtps2qq9J vcvtps2qqJ vcvtps2qq:J vcvtps2qqJ vcvtps2qqQJ vcvtps2qqQJ+https://www.felixcloutier.com/x86/vcvtps2qqKORTESTWKORTESTWkortestwOR 16-bit Masks and Set FlagskortestwHEhttps://www.felixcloutier.com/x86/kortestw:kortestb:kortestq:kortestdVPMINUQVPMINUQvpminuq,Minimum of Packed Unsigned Quadword Integers vpminuq=HvpminuqHvpminuq?HvpminuqHvpminuqAHvpminuqHvpminuq=HvpminuqHvpminuq?HvpminuqHvpminuqAHvpminuqHVPEXTRBVPEXTRBvpextrb Extract Bytevpextrb vpextrbIvpextrb# vpextrb#IVPDPBUSDVPDPBUSDvpdpbusdHPacked Dot Product of Unsigned-by-Singed Byte subvectors into Doublewordvpdpbusd9KvpdpbusdKvpdpbusd:KvpdpbusdKvpdpbusd;VvpdpbusdVvpdpbusd9KvpdpbusdWvpdpbusdKvpdpbusd/Wvpdpbusd:KvpdpbusdWvpdpbusdKvpdpbusd2Wvpdpbusd;VvpdpbusdV*https://www.felixcloutier.com/x86/vpdpbusdVSCATTERPF1DPDVSCATTERPF1DPDvscatterpf1dpd„Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint with Intent to Writevscatterpf1dpdGL]https://www.felixcloutier.com/x86/vscatterpf1dps:vscatterpf1qps:vscatterpf1dpd:vscatterpf1qpdMULSSMULSSmulss6Multiply Scalar Single-Precision Floating-Point ValuesmulssMULSSmulssMULSS''https://www.felixcloutier.com/x86/mulssPADDQPADDQpaddqAdd Packed Quadword IntegerspaddqPADDQpaddqPADDQ+paddqPADDQpaddqPADDQ/9https://www.felixcloutier.com/x86/paddb:paddw:paddd:paddq PREFETCHW PREFETCHW prefetchw4Prefetch Data into Caches in Anticipation of a Write prefetchw#B+https://www.felixcloutier.com/x86/prefetchw VFNMADD213PS VFNMADD213PS vfnmadd213psLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Values vfnmadd213ps9H vfnmadd213psH vfnmadd213ps:H vfnmadd213psH vfnmadd213ps;H vfnmadd213psH vfnmadd213ps9H vfnmadd213ps# vfnmadd213psH vfnmadd213ps/# vfnmadd213ps:H vfnmadd213ps# vfnmadd213psH vfnmadd213ps2# vfnmadd213ps;H vfnmadd213psH vfnmadd213psQH vfnmadd213psQHHhttps://www.felixcloutier.com/x86/vfnmadd132ps:vfnmadd213ps:vfnmadd231ps VPGATHERQD VPGATHERQD vpgatherqd=Gather Packed Doubleword Values Using Signed Quadword Indices vpgatherqdDH vpgatherqdHH vpgatherqdLH vpgatherqdD! vpgatherqdH!7https://www.felixcloutier.com/x86/vpgatherqd:vpgatherqqSETNASETNAsetna*Set byte if not above (CF == 1 or ZF == 1)setnaSETLS setnaSETLS#CMPPSCMPPScmpps5Compare Packed Single-Precision Floating-Point ValuescmppsCMPPScmppsCMPPS/'https://www.felixcloutier.com/x86/cmppsVCMPSDVCMPSDvcmpsd5Compare Scalar Double-Precision Floating-Point ValuesvcmpsdHvcmpsdHvcmpsd+Hvcmpsd+Hvcmpsd vcmpsd+ vcmpsdRHvcmpsdRHVHSUBPDVHSUBPDvhsubpd$Packed Double-FP Horizontal Subtractvhsubpd vhsubpd/ vhsubpd vhsubpd2 VPMINUDVPMINUDvpminud.Minimum of Packed Unsigned Doubleword Integersvpminud9HvpminudHvpminud:HvpminudHvpminud;HvpminudHvpminud9Hvpminud vpminudHvpminud/ vpminud:Hvpminud!vpminudHvpminud2!vpminud;HvpminudH VPHADDUDQ VPHADDUDQ vphaddudq5Packed Horizontal Add Unsigned Doubleword to Quadword vphaddudq" vphaddudq/"KANDNQKANDNQkandnq$Bitwise Logical AND NOT 64-bit MaskskandnqI=https://www.felixcloutier.com/x86/kandnw:kandnb:kandnq:kandnd VFMADD132PS VFMADD132PS vfmadd132psCFused Multiply-Add of Packed Single-Precision Floating-Point Values vfmadd132ps9H vfmadd132psH vfmadd132ps:H vfmadd132psH vfmadd132ps;H vfmadd132psH vfmadd132ps9H vfmadd132ps# vfmadd132psH vfmadd132ps/# vfmadd132ps:H vfmadd132ps# vfmadd132psH vfmadd132ps2# vfmadd132ps;H vfmadd132psH vfmadd132psQH vfmadd132psQHEhttps://www.felixcloutier.com/x86/vfmadd132ps:vfmadd213ps:vfmadd231ps VCVTTPD2UDQ VCVTTPD2UDQ vcvttpd2udqlConvert with Truncation Packed Double-Precision Floating-Point Values to Packed Unsigned Doubleword Integers vcvttpd2udqx=H vcvttpd2udqy?H vcvttpd2udqAH vcvttpd2udqxH vcvttpd2udqyH vcvttpd2udqH vcvttpd2udqx=H vcvttpd2udqy?H vcvttpd2udqxH vcvttpd2udqyH vcvttpd2udqAH vcvttpd2udqH vcvttpd2udqRH vcvttpd2udqRH-https://www.felixcloutier.com/x86/vcvttpd2udqCMOVGCMOVGcmovg&Move if greater (ZF == 0 and SF == OF)cmovgw  cmovgw $cmovglcmovgl'KORTESTBKORTESTBkortestbOR 8-bit Masks and Set FlagskortestbJEhttps://www.felixcloutier.com/x86/kortestw:kortestb:kortestq:kortestdPHSUBDPHSUBDphsubd.Packed Horizontal Subtract Doubleword Integersphsubdphsubd+phsubdphsubd//https://www.felixcloutier.com/x86/phsubw:phsubdBSFBSFbsfBit Scan ForwardbsfwBSFW  bsfwBSFW $bsflBSFLbsflBSFL'%https://www.felixcloutier.com/x86/bsf VFMSUB213SH VFMSUB213SH vfmsub213shFFused Multiply-Subtract of Scalar Half-Precision Floating-Point Values vfmsub213shR vfmsub213sh$R vfmsub213shR vfmsub213sh$R vfmsub213shQR vfmsub213shQRlhttps://www.felixcloutier.com/x86/vfmsub132sh:vfnmsub132sh:vfmsub213sh:vfnmsub213sh:vfmsub231sh:vfnmsub231shVSCATTERPF1QPSVSCATTERPF1QPSvscatterpf1qps‚Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint with Intent to Writevscatterpf1qpsML]https://www.felixcloutier.com/x86/vscatterpf1dps:vscatterpf1qps:vscatterpf1dpd:vscatterpf1qpd VGETMANTSD VGETMANTSD vgetmantsdMExtract Normalized Mantissa from Scalar Double-Precision Floating-Point Value vgetmantsdH vgetmantsd+H vgetmantsdH vgetmantsd+H vgetmantsdRH vgetmantsdRH,https://www.felixcloutier.com/x86/vgetmantsdKMOVBKMOVBkmovbMove 8-bit MaskkmovbJkmovbJkmovb#JkmovbJkmovb#J9https://www.felixcloutier.com/x86/kmovw:kmovb:kmovq:kmovdVPABSWVPABSWvpabsw&Packed Absolute Value of Word IntegersvpabswIvpabswIvpabswIvpabsw/Ivpabsw2Ivpabsw5Ivpabsw vpabswIvpabsw/ vpabsw/Ivpabsw!vpabswIvpabsw2!vpabsw2IvpabswIvpabsw5IEMMSEMMSemmsExit MMX StateemmsEMMS &https://www.felixcloutier.com/x86/emmsPSRLWPSRLWpsrlw$Shift Packed Word Data Right Logicalpsrlw psrlw psrlw+ psrlwpsrlwpsrlw/3https://www.felixcloutier.com/x86/psrlw:psrld:psrlq VCVTPH2UDQ VCVTPH2UDQ vcvtph2udq`Convert Packed Half-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Values vcvtph2udq.K vcvtph2udq<K vcvtph2udq>R vcvtph2udqK vcvtph2udqK vcvtph2udqR vcvtph2udq.K vcvtph2udqK vcvtph2udq<K vcvtph2udqK vcvtph2udq>R vcvtph2udqR vcvtph2udqQR vcvtph2udqQR,https://www.felixcloutier.com/x86/vcvtph2udq VCVTPD2UDQ VCVTPD2UDQ vcvtpd2udq\Convert Packed Double-Precision Floating-Point Values to Packed Unsigned Doubleword Integers vcvtpd2udqx=H vcvtpd2udqy?H vcvtpd2udqAH vcvtpd2udqxH vcvtpd2udqyH vcvtpd2udqH vcvtpd2udqx=H vcvtpd2udqy?H vcvtpd2udqxH vcvtpd2udqyH vcvtpd2udqAH vcvtpd2udqH vcvtpd2udqQH vcvtpd2udqQH,https://www.felixcloutier.com/x86/vcvtpd2udq VCVTTPH2UW VCVTTPH2UW vcvttph2uwjConvert with Truncation Packed Half-Precision Floating-Point Values to Packed Unsigned Word Integer Values vcvttph2uw<K vcvttph2uw>K vcvttph2uw@R vcvttph2uwK vcvttph2uwK vcvttph2uwR vcvttph2uw<K vcvttph2uwK vcvttph2uw>K vcvttph2uwK vcvttph2uw@R vcvttph2uwR vcvttph2uwRR vcvttph2uwRR,https://www.felixcloutier.com/x86/vcvttph2uwVMOVDVMOVDvmovdMove Doublewordvmovd vmovdHvmovd vmovdHvmovd' vmovd'Hvmovd' vmovd'H VCVTUW2PH VCVTUW2PH vcvtuw2phTConvert Packed Unsigned Word Integers to Packed Half-Precision Floating-Point Values vcvtuw2ph<K vcvtuw2ph>K vcvtuw2ph@R vcvtuw2phK vcvtuw2phK vcvtuw2phR vcvtuw2ph<K vcvtuw2phK vcvtuw2ph>K vcvtuw2phK vcvtuw2ph@R vcvtuw2phR vcvtuw2phQR vcvtuw2phQR+https://www.felixcloutier.com/x86/vcvtuw2phVPHADDBDVPHADDBDvphaddbd6Packed Horizontal Add Signed Byte to Signed Doublewordvphaddbd"vphaddbd/"KORTESTQKORTESTQkortestqOR 64-bit Masks and Set FlagskortestqIEhttps://www.felixcloutier.com/x86/kortestw:kortestb:kortestq:kortestdCMOVNAECMOVNAEcmovnae$Move if not above or equal (CF == 1)cmovnaew  cmovnaew $cmovnaelcmovnael'PMOVSXBDPMOVSXBDpmovsxbdDMove Packed Byte Integers to Doubleword Integers with Sign Extensionpmovsxbdpmovsxbd'VADDSSVADDSSvaddss1Add Scalar Single-Precision Floating-Point ValuesvaddssHvaddss'Hvaddss vaddssHvaddss' vaddss'HvaddssQHvaddssQH VBLENDMPD VBLENDMPD vblendmpdLBlend Packed Double-Precision Floating-Point Vectors Using an OpMask Control  vblendmpd=H vblendmpdH vblendmpd?H vblendmpdH vblendmpdAH vblendmpdH vblendmpd=H vblendmpdH vblendmpd?H vblendmpdH vblendmpdAH vblendmpdH5https://www.felixcloutier.com/x86/vblendmpd:vblendmps VCVTTPD2UQQ VCVTTPD2UQQ vcvttpd2uqqjConvert with Truncation Packed Double-Precision Floating-Point Values to Packed Unsigned Quadword Integers vcvttpd2uqq=J vcvttpd2uqq?J vcvttpd2uqqAJ vcvttpd2uqqJ vcvttpd2uqqJ vcvttpd2uqqJ vcvttpd2uqq=J vcvttpd2uqqJ vcvttpd2uqq?J vcvttpd2uqqJ vcvttpd2uqqAJ vcvttpd2uqqJ vcvttpd2uqqRJ vcvttpd2uqqRJ-https://www.felixcloutier.com/x86/vcvttpd2uqq PUNPCKHDQ PUNPCKHDQ punpckhdq;Unpack and Interleave High-Order Doublewords into Quadwords punpckhdq  punpckhdq+  punpckhdq punpckhdq/Jhttps://www.felixcloutier.com/x86/punpckhbw:punpckhwd:punpckhdq:punpckhqdq VFMADDCSH VFMADDCSH vfmaddcshIFused Multiply-Add of Complex Scalar Half-Precision Floating-Point Values vfmaddcshR vfmaddcsh'R vfmaddcshR vfmaddcsh'R vfmaddcshQR vfmaddcshQR6https://www.felixcloutier.com/x86/vfcmaddcsh:vfmaddcshSETBSETBsetbSet byte if below (CF == 1)setbSETCS setbSETCS#ORPSORPSorps<Bitwise Logical OR of Single-Precision Floating-Point ValuesorpsORPSorpsORPS/&https://www.felixcloutier.com/x86/orpsPAVGBPAVGBpavgbAverage Packed Byte Integerspavgb pavgb+ pavgbpavgb/-https://www.felixcloutier.com/x86/pavgb:pavgw VGATHERDPS VGATHERDPS vgatherdpsTGather Packed Single-Precision Floating-Point Values Using Signed Doubleword Indices vgatherdpsBH vgatherdpsFH vgatherdpsJH vgatherdpsB! vgatherdpsF!7https://www.felixcloutier.com/x86/vgatherdps:vgatherdpdPMAXUWPMAXUWpmaxuw(Maximum of Packed Unsigned Word Integerspmaxuwpmaxuw//https://www.felixcloutier.com/x86/pmaxub:pmaxuwVPCMPGTDVPCMPGTDvpcmpgtd:Compare Packed Signed Doubleword Integers for Greater Thanvpcmpgtd9Hvpcmpgtd9HvpcmpgtdHvpcmpgtdHvpcmpgtd:Hvpcmpgtd:HvpcmpgtdHvpcmpgtdHvpcmpgtd;Hvpcmpgtd;HvpcmpgtdHvpcmpgtdHvpcmpgtd vpcmpgtd/ vpcmpgtd!vpcmpgtd2!VPMOVSDWVPMOVSDWvpmovsdwKDown Convert Packed Doubleword Values to Word Values with Signed Saturation vpmovsdwHvpmovsdw,HvpmovsdwHvpmovsdw0HvpmovsdwHvpmovsdw3HvpmovsdwHvpmovsdwHvpmovsdwHvpmovsdw+Hvpmovsdw/Hvpmovsdw2H<https://www.felixcloutier.com/x86/vpmovdw:vpmovsdw:vpmovusdw VPMOVSXBW VPMOVSXBW vpmovsxbw>Move Packed Byte Integers to Word Integers with Sign Extension vpmovsxbwI vpmovsxbwI vpmovsxbwI vpmovsxbw+I vpmovsxbw/I vpmovsxbw2I vpmovsxbw  vpmovsxbwI vpmovsxbw+  vpmovsxbw+I vpmovsxbw! vpmovsxbwI vpmovsxbw/! vpmovsxbw/I vpmovsxbwI vpmovsxbw2IVRCPSSVRCPSSvrcpssOCompute Approximate Reciprocal of Scalar Single-Precision Floating-Point Valuesvrcpss vrcpss' PABSBPABSBpabsb&Packed Absolute Value of Byte Integerspabsbpabsb+pabsbpabsb/9https://www.felixcloutier.com/x86/pabsb:pabsw:pabsd:pabsqCVTSI2SDCVTSI2SDcvtsi2sd9Convert Dword Integer to Scalar Double-Precision FP Value cvtsi2sdlCVTSL2SD cvtsi2sdlCVTSL2SD'*https://www.felixcloutier.com/x86/cvtsi2sdJNAEJNAEjnae$Jump if not above or equal (CF == 1)jnaeJCSNjnaeJCSOVROUNDPDVROUNDPDvroundpd3Round Packed Double Precision Floating-Point Valuesvroundpd vroundpd/ vroundpd vroundpd2 PSHUFWPSHUFWpshufwShuffle Packed Wordspshufw pshufw+ (https://www.felixcloutier.com/x86/pshufw VPUNPCKHDQ VPUNPCKHDQ vpunpckhdq;Unpack and Interleave High-Order Doublewords into Quadwords vpunpckhdq9H vpunpckhdqH vpunpckhdq:H vpunpckhdqH vpunpckhdq;H vpunpckhdqH vpunpckhdq9H vpunpckhdq  vpunpckhdqH vpunpckhdq/  vpunpckhdq:H vpunpckhdq! vpunpckhdqH vpunpckhdq2! vpunpckhdq;H vpunpckhdqHVPMAXUWVPMAXUWvpmaxuw(Maximum of Packed Unsigned Word IntegersvpmaxuwIvpmaxuw/IvpmaxuwIvpmaxuw2IvpmaxuwIvpmaxuw5Ivpmaxuw vpmaxuwIvpmaxuw/ vpmaxuw/Ivpmaxuw!vpmaxuwIvpmaxuw2!vpmaxuw2IvpmaxuwIvpmaxuw5I VCVTUDQ2PS VCVTUDQ2PS vcvtudq2ps\Convert Packed Unsigned Doubleword Integers to Packed Single-Precision Floating-Point Values vcvtudq2ps9H vcvtudq2ps:H vcvtudq2ps;H vcvtudq2psH vcvtudq2psH vcvtudq2psH vcvtudq2ps9H vcvtudq2psH vcvtudq2ps:H vcvtudq2psH vcvtudq2ps;H vcvtudq2psH vcvtudq2psQH vcvtudq2psQH,https://www.felixcloutier.com/x86/vcvtudq2ps SHA1NEXTE SHA1NEXTE sha1nexte1Calculate SHA1 State Variable E after Four Rounds sha1nexte( sha1nexte/(+https://www.felixcloutier.com/x86/sha1nexteVFMADDSUB231PDVFMADDSUB231PDvfmaddsub231pdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsub231pd=Hvfmaddsub231pdHvfmaddsub231pd?Hvfmaddsub231pdHvfmaddsub231pdAHvfmaddsub231pdHvfmaddsub231pd=Hvfmaddsub231pd#vfmaddsub231pdHvfmaddsub231pd/#vfmaddsub231pd?Hvfmaddsub231pd#vfmaddsub231pdHvfmaddsub231pd2#vfmaddsub231pdAHvfmaddsub231pdHvfmaddsub231pdQHvfmaddsub231pdQHNhttps://www.felixcloutier.com/x86/vfmaddsub132pd:vfmaddsub213pd:vfmaddsub231pd VMASKMOVPS VMASKMOVPS vmaskmovps>Conditional Move Packed Single-Precision Floating-Point Values vmaskmovps/  vmaskmovps2  vmaskmovps/  vmaskmovps2  VPMOVSXBQ VPMOVSXBQ vpmovsxbqBMove Packed Byte Integers to Quadword Integers with Sign Extension vpmovsxbqH vpmovsxbqH vpmovsxbqH vpmovsxbq$H vpmovsxbq'H vpmovsxbq+H vpmovsxbq  vpmovsxbqH vpmovsxbq$  vpmovsxbq$H vpmovsxbq! vpmovsxbqH vpmovsxbq'! vpmovsxbq'H vpmovsxbqH vpmovsxbq+H VSM4RNDS4 VSM4RNDS4 vsm4rnds4&Performs Four Rounds of SM4 Encryption vsm4rnds4 vsm4rnds4/ vsm4rnds4 vsm4rnds42KSHIFTRBKSHIFTRBkshiftrbShift Right 8-bit MaskskshiftrbJEhttps://www.felixcloutier.com/x86/kshiftrw:kshiftrb:kshiftrq:kshiftrdRDSEEDRDSEEDrdseedRead Random SEEDrdseed +rdseed+(https://www.felixcloutier.com/x86/rdseedMOVMSKPSMOVMSKPSmovmskps8Extract Packed Single-Precision Floating-Point Sign MaskmovmskpsMOVMSKPS*https://www.felixcloutier.com/x86/movmskpsPFMULPFMULpfmulPacked Floating-Point Multiplypfmulpfmul+ SHA256MSG2 SHA256MSG2 sha256msg2HPerform a Final Calculation for the Next Four SHA256 Message Doublewords sha256msg2( sha256msg2/(,https://www.felixcloutier.com/x86/sha256msg2UNPCKHPSUNPCKHPSunpckhpsHUnpack and Interleave High Packed Single-Precision Floating-Point ValuesunpckhpsUNPCKHPSunpckhpsUNPCKHPS/*https://www.felixcloutier.com/x86/unpckhpsRETRETretReturn from ProcedureretlRETretl%https://www.felixcloutier.com/x86/ret VCVTPS2PHX VCVTPS2PHX vcvtps2phx<Convert Single-Precision FP value to Half-Precision FP value vcvtps2phxx9K vcvtps2phxy:K vcvtps2phx;R vcvtps2phxxK vcvtps2phxyK vcvtps2phxR vcvtps2phxx9K vcvtps2phxy:K vcvtps2phxxK vcvtps2phxyK vcvtps2phx;R vcvtps2phxR vcvtps2phxQR vcvtps2phxQR,https://www.felixcloutier.com/x86/vcvtps2phxVPSRLDQVPSRLDQvpsrldq*Shift Packed Double Quadword Right Logicalvpsrldq vpsrldqIvpsrldq/Ivpsrldq!vpsrldqIvpsrldq2IvpsrldqIvpsrldq5IPBLENDVBPBLENDVBpblendvbVariable Blend Packed Bytespblendvbpblendvb/*https://www.felixcloutier.com/x86/pblendvb VFNMADD132SS VFNMADD132SS vfnmadd132ssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Values vfnmadd132ssH vfnmadd132ss'H vfnmadd132ss# vfnmadd132ssH vfnmadd132ss'# vfnmadd132ss'H vfnmadd132ssQH vfnmadd132ssQHHhttps://www.felixcloutier.com/x86/vfnmadd132ss:vfnmadd213ss:vfnmadd231ssVPMULHUWVPMULHUWvpmulhuw<Multiply Packed Unsigned Word Integers and Store High ResultvpmulhuwIvpmulhuw/IvpmulhuwIvpmulhuw2IvpmulhuwIvpmulhuw5Ivpmulhuw vpmulhuwIvpmulhuw/ vpmulhuw/Ivpmulhuw!vpmulhuwIvpmulhuw2!vpmulhuw2IvpmulhuwIvpmulhuw5IVADDPSVADDPSvaddps1Add Packed Single-Precision Floating-Point Valuesvaddps9HvaddpsHvaddps:HvaddpsHvaddps;HvaddpsHvaddps9Hvaddps vaddpsHvaddps/ vaddps:Hvaddps vaddpsHvaddps2 vaddps;HvaddpsHvaddpsQHvaddpsQH VRNDSCALESD VRNDSCALESD vrndscalesd]Round Scalar Double-Precision Floating-Point Value To Include A Given Number Of Fraction Bits vrndscalesdH vrndscalesd+H vrndscalesdH vrndscalesd+H vrndscalesdRH vrndscalesdRH-https://www.felixcloutier.com/x86/vrndscalesdNOTNOTnotOne's Complement NegationnotbNOTB notwNOTW notlNOTLnotbNOTB#notwNOTW$notlNOTL'%https://www.felixcloutier.com/x86/notPMINUDPMINUDpminud.Minimum of Packed Unsigned Doubleword Integerspminudpminud//https://www.felixcloutier.com/x86/pminud:pminuqEXTRQEXTRQextrq Extract FieldextrqextrqPCMPGTWPCMPGTWpcmpgtw4Compare Packed Signed Word Integers for Greater Thanpcmpgtw pcmpgtw+ pcmpgtwpcmpgtw/9https://www.felixcloutier.com/x86/pcmpgtb:pcmpgtw:pcmpgtd VMOVMSKPS VMOVMSKPS vmovmskps8Extract Packed Single-Precision Floating-Point Sign Mask vmovmskps  vmovmskps  VPMASKMOVD VPMASKMOVD vpmaskmovd+Conditional Move Packed Doubleword Integers vpmaskmovd/! vpmaskmovd2! vpmaskmovd/! vpmaskmovd2!VPSRAVQVPSRAVQvpsravq4Variable Shift Packed Quadword Data Right Arithmetic vpsravq=HvpsravqHvpsravq?HvpsravqHvpsravqAHvpsravqHvpsravq=HvpsravqHvpsravq?HvpsravqHvpsravqAHvpsravqH9https://www.felixcloutier.com/x86/vpsravw:vpsravd:vpsravqVPLZCNTDVPLZCNTDvplzcntdBCount the Number of Leading Zero Bits for Packed Doubleword Values vplzcntd9Nvplzcntd:Nvplzcntd;NvplzcntdNvplzcntdNvplzcntdNvplzcntd9NvplzcntdNvplzcntd:NvplzcntdNvplzcntd;NvplzcntdN3https://www.felixcloutier.com/x86/vplzcntd:vplzcntq VREDUCEPH VREDUCEPH vreducephOPerform Reduction Transformation on Packed Half-Precision Floating-Point Values vreduceph<K vreduceph>K vreduceph@R vreducephK vreducephK vreducephR vreduceph<K vreducephK vreduceph>K vreducephK vreduceph@R vreducephR vreducephRR vreducephRR+https://www.felixcloutier.com/x86/vreducephKMOVDKMOVDkmovdMove 32-bit MaskkmovdIkmovdIkmovd'IkmovdIkmovd'I9https://www.felixcloutier.com/x86/kmovw:kmovb:kmovq:kmovdVPADDSWVPADDSWvpaddsw6Add Packed Signed Word Integers with Signed SaturationvpaddswIvpaddsw/IvpaddswIvpaddsw2IvpaddswIvpaddsw5Ivpaddsw vpaddswIvpaddsw/ vpaddsw/Ivpaddsw!vpaddswIvpaddsw2!vpaddsw2IvpaddswIvpaddsw5IVPMAXUBVPMAXUBvpmaxub(Maximum of Packed Unsigned Byte IntegersvpmaxubIvpmaxub/IvpmaxubIvpmaxub2IvpmaxubIvpmaxub5Ivpmaxub vpmaxubIvpmaxub/ vpmaxub/Ivpmaxub!vpmaxubIvpmaxub2!vpmaxub2IvpmaxubIvpmaxub5IVCMPPHVCMPPHvcmpph3Compare Packed Half-Precision Floating-Point Valuesvcmpph<Kvcmpph<KvcmpphKvcmpphKvcmpph>Kvcmpph>KvcmpphKvcmpphKvcmpph@Rvcmpph@RvcmpphRvcmpphRvcmpphRRvcmpphRR(https://www.felixcloutier.com/x86/vcmpphCVTSD2SSCVTSD2SScvtsd2ssLConvert Scalar Double-Precision FP Value to Scalar Single-Precision FP Valuecvtsd2ssCVTSD2SScvtsd2ssCVTSD2SS+*https://www.felixcloutier.com/x86/cvtsd2ssBTBTbtBit TestbtwBTW btwBTW  btlBTLbtlBTLbtwBTW$btwBTW$ btlBTL'btlBTL'$https://www.felixcloutier.com/x86/btCMOVNECMOVNEcmovneMove if not equal (ZF == 0)cmovnew  cmovnew $cmovnelcmovnel' VCVTPD2QQ VCVTPD2QQ vcvtpd2qqQConvert Packed Double-Precision Floating-Point Values to Packed Quadword Integers vcvtpd2qq=J vcvtpd2qq?J vcvtpd2qqAJ vcvtpd2qqJ vcvtpd2qqJ vcvtpd2qqJ vcvtpd2qq=J vcvtpd2qqJ vcvtpd2qq?J vcvtpd2qqJ vcvtpd2qqAJ vcvtpd2qqJ vcvtpd2qqQJ vcvtpd2qqQJ+https://www.felixcloutier.com/x86/vcvtpd2qq VFCMADDCPH VFCMADDCPH vfcmaddcphSFused Conjugate Multiply-Add of Complex Packed Half-Precision Floating-Point Values vfcmaddcph9K vfcmaddcphK vfcmaddcph:K vfcmaddcphK vfcmaddcph;R vfcmaddcphR vfcmaddcph9K vfcmaddcphK vfcmaddcph:K vfcmaddcphK vfcmaddcph;R vfcmaddcphR vfcmaddcphQR vfcmaddcphQR6https://www.felixcloutier.com/x86/vfcmaddcph:vfmaddcph VMOVMSKPD VMOVMSKPD vmovmskpd8Extract Packed Double-Precision Floating-Point Sign Mask vmovmskpd  vmovmskpd  VREDUCEPD VREDUCEPD vreducepdQPerform Reduction Transformation on Packed Double-Precision Floating-Point Values  vreducepd=J vreducepd?J vreducepdAJ vreducepdJ vreducepdJ vreducepdJ vreducepd=J vreducepdJ vreducepd?J vreducepdJ vreducepdAJ vreducepdJ+https://www.felixcloutier.com/x86/vreducepdVPSLLVQVPSLLVQvpsllvq0Variable Shift Packed Quadword Data Left Logicalvpsllvq=HvpsllvqHvpsllvq?HvpsllvqHvpsllvqAHvpsllvqHvpsllvq=Hvpsllvq!vpsllvqHvpsllvq/!vpsllvq?Hvpsllvq!vpsllvqHvpsllvq2!vpsllvqAHvpsllvqH9https://www.felixcloutier.com/x86/vpsllvw:vpsllvd:vpsllvqVRSQRTSHVRSQRTSHvrsqrtshOCompute Reciprocal of Square Root of Scalar Half-Precision Floating-Point ValuevrsqrtshRvrsqrtsh$RvrsqrtshRvrsqrtsh$R*https://www.felixcloutier.com/x86/vrsqrtsh VPUNPCKLQDQ VPUNPCKLQDQ vpunpcklqdq?Unpack and Interleave Low-Order Quadwords into Double Quadwords vpunpcklqdq=H vpunpcklqdqH vpunpcklqdq?H vpunpcklqdqH vpunpcklqdqAH vpunpcklqdqH vpunpcklqdq=H vpunpcklqdq  vpunpcklqdqH vpunpcklqdq/  vpunpcklqdq?H vpunpcklqdq! vpunpcklqdqH vpunpcklqdq2! vpunpcklqdqAH vpunpcklqdqHVANDNPSVANDNPSvandnpsHBitwise Logical AND NOT of Packed Single-Precision Floating-Point Valuesvandnps9JvandnpsJvandnps:JvandnpsJvandnps;JvandnpsJvandnps9Jvandnps vandnpsJvandnps/ vandnps:Jvandnps vandnpsJvandnps2 vandnps;JvandnpsJ VSCALEFSD VSCALEFSD vscalefsd_Scale Scalar Double-Precision Floating-Point Value With a Double-Precision Floating-Point Value vscalefsdH vscalefsd+H vscalefsdH vscalefsd+H vscalefsdQH vscalefsdQH+https://www.felixcloutier.com/x86/vscalefsdXGETBVXGETBVxgetbv&Get Value of Extended Control Registerxgetbv(https://www.felixcloutier.com/x86/xgetbvADCXADCXadcx9Unsigned Integer Addition of Two Operands with Carry Flagadcxl7adcxl'7&https://www.felixcloutier.com/x86/adcxXORXORxorLogical Exclusive ORxorbXORBxorbXORB xorbXORB  xorbXORB #xorwXORW xorwXORW xorwXORW xorwXORW  xorwXORW $xorlXORLxorlXORLxorlXORLxorlXORLxorlXORL'xorbXORB#xorbXORB# xorwXORW$xorwXORW$xorwXORW$ xorlXORL'xorlXORL'xorlXORL'%https://www.felixcloutier.com/x86/xorANDPDANDPDandpdDBitwise Logical AND of Packed Double-Precision Floating-Point ValuesandpdANDPDandpdANDPD/'https://www.felixcloutier.com/x86/andpdVFMSUBADD231PDVFMSUBADD231PDvfmsubadd231pdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubadd231pd=Hvfmsubadd231pdHvfmsubadd231pd?Hvfmsubadd231pdHvfmsubadd231pdAHvfmsubadd231pdHvfmsubadd231pd=Hvfmsubadd231pd#vfmsubadd231pdHvfmsubadd231pd/#vfmsubadd231pd?Hvfmsubadd231pd#vfmsubadd231pdHvfmsubadd231pd2#vfmsubadd231pdAHvfmsubadd231pdHvfmsubadd231pdQHvfmsubadd231pdQHNhttps://www.felixcloutier.com/x86/vfmsubadd132pd:vfmsubadd213pd:vfmsubadd231pd VGATHERPF1QPS VGATHERPF1QPS vgatherpf1qpsmSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint vgatherpf1qpsMLYhttps://www.felixcloutier.com/x86/vgatherpf1dps:vgatherpf1qps:vgatherpf1dpd:vgatherpf1qpdCMPSDCMPSDcmpsd5Compare Scalar Double-Precision Floating-Point ValuescmpsdCMPSDcmpsdCMPSD+'https://www.felixcloutier.com/x86/cmpsdVCMPPDVCMPPDvcmppd5Compare Packed Double-Precision Floating-Point Valuesvcmppd=Hvcmppd=HvcmppdHvcmppdHvcmppd?Hvcmppd?HvcmppdHvcmppdHvcmppdAHvcmppdAHvcmppdHvcmppdHvcmppd vcmppd/ vcmppd vcmppd2 vcmppdRHvcmppdRH VFNMSUB132SS VFNMSUB132SS vfnmsub132ssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfnmsub132ssH vfnmsub132ss'H vfnmsub132ss# vfnmsub132ssH vfnmsub132ss'# vfnmsub132ss'H vfnmsub132ssQH vfnmsub132ssQHHhttps://www.felixcloutier.com/x86/vfnmsub132ss:vfnmsub213ss:vfnmsub231ssSETPOSETPOsetpo Set byte if parity odd (PF == 0)setpoSETPC setpoSETPC# VBCSTNESH2PS VBCSTNESH2PS vbcstnesh2ps<Load FP16 Element and Convert to FP32 Element with Broadcast vbcstnesh2ps$Z vbcstnesh2ps$ZVPMINSDVPMINSDvpminsd,Minimum of Packed Signed Doubleword Integersvpminsd9HvpminsdHvpminsd:HvpminsdHvpminsd;HvpminsdHvpminsd9Hvpminsd vpminsdHvpminsd/ vpminsd:Hvpminsd!vpminsdHvpminsd2!vpminsd;HvpminsdHKANDDKANDDkandd Bitwise Logical AND 32-bit MaskskanddI9https://www.felixcloutier.com/x86/kandw:kandb:kandq:kanddVPMOVQDVPMOVQDvpmovqdHDown Convert Packed Quadword Values to Doubleword Values with Truncation vpmovqdHvpmovqd,HvpmovqdHvpmovqd0HvpmovqdHvpmovqd3HvpmovqdHvpmovqdHvpmovqdHvpmovqd+Hvpmovqd/Hvpmovqd2H<https://www.felixcloutier.com/x86/vpmovqd:vpmovsqd:vpmovusqd VSHUFI64X2 VSHUFI64X2 vshufi64x2.Shuffle 128-Bit Packed Quadword Integer Values vshufi64x2?H vshufi64x2H vshufi64x2AH vshufi64x2H vshufi64x2?H vshufi64x2H vshufi64x2AH vshufi64x2HMOVNTSSMOVNTSSmovntssKStore Scalar Single-Precision Floating-Point Values Using Non-Temporal Hintmovntss' VFMADD231PD VFMADD231PD vfmadd231pdCFused Multiply-Add of Packed Double-Precision Floating-Point Values vfmadd231pd=H vfmadd231pdH vfmadd231pd?H vfmadd231pdH vfmadd231pdAH vfmadd231pdH vfmadd231pd=H vfmadd231pd# vfmadd231pdH vfmadd231pd/# vfmadd231pd?H vfmadd231pd# vfmadd231pdH vfmadd231pd2# vfmadd231pdAH vfmadd231pdH vfmadd231pdQH vfmadd231pdQHEhttps://www.felixcloutier.com/x86/vfmadd132pd:vfmadd213pd:vfmadd231pdVMINSHVMINSHvminsh9Return Minimum Scalar Half-Precision Floating-Point ValuevminshRvminsh$RvminshRvminsh$RvminshRRvminshRR(https://www.felixcloutier.com/x86/vminshVSCATTERPF1QPDVSCATTERPF1QPDvscatterpf1qpd‚Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint with Intent to Writevscatterpf1qpdML]https://www.felixcloutier.com/x86/vscatterpf1dps:vscatterpf1qps:vscatterpf1dpd:vscatterpf1qpdMOVHPDMOVHPDmovhpd6Move High Packed Double-Precision Floating-Point ValuemovhpdMOVHPD+movhpdMOVHPD+(https://www.felixcloutier.com/x86/movhpdAESIMCAESIMCaesimc+Perform the AES InvMixColumn Transformationaesimc'aesimc/'(https://www.felixcloutier.com/x86/aesimcMOVNTPSMOVNTPSmovntpsKStore Packed Single-Precision Floating-Point Values Using Non-Temporal HintmovntpsMOVNTPS/)https://www.felixcloutier.com/x86/movntpsSETGSETGsetg*Set byte if greater (ZF == 0 and SF == OF)setgSETGT setgSETGT#SHA1MSG2SHA1MSG2sha1msg2FPerform a Final Calculation for the Next Four SHA1 Message Doublewordssha1msg2(sha1msg2/(*https://www.felixcloutier.com/x86/sha1msg2 VGETMANTPH VGETMANTPH vgetmantphMExtract Normalized Mantissas from Packed Half-Precision Floating-Point Values vgetmantph<K vgetmantph>K vgetmantph@R vgetmantphK vgetmantphK vgetmantphR vgetmantph<K vgetmantphK vgetmantph>K vgetmantphK vgetmantph@R vgetmantphR vgetmantphRR vgetmantphRR,https://www.felixcloutier.com/x86/vgetmantphVPSRLVDVPSRLVDvpsrlvd3Variable Shift Packed Doubleword Data Right Logicalvpsrlvd9HvpsrlvdHvpsrlvd:HvpsrlvdHvpsrlvd;HvpsrlvdHvpsrlvd9Hvpsrlvd!vpsrlvdHvpsrlvd/!vpsrlvd:Hvpsrlvd!vpsrlvdHvpsrlvd2!vpsrlvd;HvpsrlvdH9https://www.felixcloutier.com/x86/vpsrlvw:vpsrlvd:vpsrlvqAESDECAESDECaesdec+Perform One Round of an AES Decryption Flowaesdec'aesdec/'(https://www.felixcloutier.com/x86/aesdecVXORPDVXORPDvxorpd>Bitwise Logical XOR for Double-Precision Floating-Point Valuesvxorpd=JvxorpdJvxorpd?JvxorpdJvxorpdAJvxorpdJvxorpd=Jvxorpd vxorpdJvxorpd/ vxorpd?Jvxorpd vxorpdJvxorpd2 vxorpdAJvxorpdJVDIVSSVDIVSSvdivss4Divide Scalar Single-Precision Floating-Point ValuesvdivssHvdivss'Hvdivss vdivssHvdivss' vdivss'HvdivssQHvdivssQHLDDQULDDQUlddquLoad Unaligned Integer 128 Bitslddqu/'https://www.felixcloutier.com/x86/lddqu VPMOVUSQD VPMOVUSQD vpmovusqdQDown Convert Packed Quadword Values to Doubleword Values with Unsigned Saturation  vpmovusqdH vpmovusqd,H vpmovusqdH vpmovusqd0H vpmovusqdH vpmovusqd3H vpmovusqdH vpmovusqdH vpmovusqdH vpmovusqd+H vpmovusqd/H vpmovusqd2H<https://www.felixcloutier.com/x86/vpmovqd:vpmovsqd:vpmovusqdPMINSWPMINSWpminsw&Minimum of Packed Signed Word IntegerspminswPMINSW pminswPMINSW+ pminswPMINSWpminswPMINSW//https://www.felixcloutier.com/x86/pminsb:pminswPSWAPDPSWAPDpswapdPacked Swap Doublewordpswapdpswapd+ VEXTRACTI128 VEXTRACTI128 vextracti128Extract Packed Integer Values vextracti128! vextracti128/!fhttps://www.felixcloutier.com/x86/vextracti128:vextracti32x4:vextracti64x2:vextracti32x8:vextracti64x4VDIVPSVDIVPSvdivps4Divide Packed Single-Precision Floating-Point Valuesvdivps9HvdivpsHvdivps:HvdivpsHvdivps;HvdivpsHvdivps9Hvdivps vdivpsHvdivps/ vdivps:Hvdivps vdivpsHvdivps2 vdivps;HvdivpsHvdivpsQHvdivpsQH VINSERTI64X4 VINSERTI64X4 vinserti64x41Insert 256 Bits of Packed Quadword Integer Values vinserti64x4H vinserti64x42H vinserti64x4H vinserti64x42HJNGEJNGEjnge'Jump if not greater or equal (SF != OF)jngeJLTNjngeJLTO VPMOVSXWD VPMOVSXWD vpmovsxwdDMove Packed Word Integers to Doubleword Integers with Sign Extension vpmovsxwdH vpmovsxwdH vpmovsxwdH vpmovsxwd+H vpmovsxwd/H vpmovsxwd2H vpmovsxwd  vpmovsxwdH vpmovsxwd+  vpmovsxwd+H vpmovsxwd! vpmovsxwdH vpmovsxwd/! vpmovsxwd/H vpmovsxwdH vpmovsxwd2HSHLXSHLXshlx*Logical Shift Left Without Affecting Flagsshlxl5shlxl'50https://www.felixcloutier.com/x86/sarx:shlx:shrxVMULSSVMULSSvmulss6Multiply Scalar Single-Precision Floating-Point ValuesvmulssHvmulss'Hvmulss vmulssHvmulss' vmulss'HvmulssQHvmulssQHVTESTPDVTESTPDvtestpd/Packed Double-Precision Floating-Point Bit Testvtestpd vtestpd/ vtestpd vtestpd2 1https://www.felixcloutier.com/x86/vtestpd:vtestpsPFCMPEQPFCMPEQpfcmpeq'Packed Floating-Point Compare for Equalpfcmpeqpfcmpeq+VPSHRDVDVPSHRDVDvpshrdvdCConcatenate and Variable Shift Packed Doubleword Data Right Logical vpshrdvd9KvpshrdvdKvpshrdvd:KvpshrdvdKvpshrdvd;UvpshrdvdUvpshrdvd9KvpshrdvdKvpshrdvd:KvpshrdvdKvpshrdvd;UvpshrdvdUPCMPEQBPCMPEQBpcmpeqb%Compare Packed Byte Data for EqualitypcmpeqbPCMPEQB pcmpeqbPCMPEQB+ pcmpeqbPCMPEQBpcmpeqbPCMPEQB/9https://www.felixcloutier.com/x86/pcmpeqb:pcmpeqw:pcmpeqd VPUNPCKLWD VPUNPCKLWD vpunpcklwd6Unpack and Interleave Low-Order Words into Doublewords vpunpcklwdI vpunpcklwd/I vpunpcklwdI vpunpcklwd2I vpunpcklwdI vpunpcklwd5I vpunpcklwd  vpunpcklwdI vpunpcklwd/  vpunpcklwd/I vpunpcklwd! vpunpcklwdI vpunpcklwd2! vpunpcklwd2I vpunpcklwdI vpunpcklwd5I VFNMSUB231SH VFNMSUB231SH vfnmsub231shOFused Negative Multiply-Subtract of Scalar Half-Precision Floating-Point Values vfnmsub231shR vfnmsub231sh$R vfnmsub231shR vfnmsub231sh$R vfnmsub231shQR vfnmsub231shQRlhttps://www.felixcloutier.com/x86/vfmsub132sh:vfnmsub132sh:vfmsub213sh:vfnmsub213sh:vfmsub231sh:vfnmsub231sh VFIXUPIMMPD VFIXUPIMMPD vfixupimmpd<Fix Up Special Packed Double-Precision Floating-Point Values vfixupimmpd=H vfixupimmpdH vfixupimmpd?H vfixupimmpdH vfixupimmpdAH vfixupimmpdH vfixupimmpd=H vfixupimmpdH vfixupimmpd?H vfixupimmpdH vfixupimmpdAH vfixupimmpdH vfixupimmpdRH vfixupimmpdRH-https://www.felixcloutier.com/x86/vfixupimmpdKSHIFTLWKSHIFTLWkshiftlwShift Left 16-bit MaskskshiftlwHEhttps://www.felixcloutier.com/x86/kshiftlw:kshiftlb:kshiftlq:kshiftldPSLLDQPSLLDQpslldq)Shift Packed Double Quadword Left Logicalpslldq(https://www.felixcloutier.com/x86/pslldqMOVMSKPDMOVMSKPDmovmskpd8Extract Packed Double-Precision Floating-Point Sign MaskmovmskpdMOVMSKPD*https://www.felixcloutier.com/x86/movmskpdVRANGESDVRANGESDvrangesdYRange Restriction Calculation For a pair of Scalar Double-Precision Floating-Point ValuesvrangesdJvrangesd+JvrangesdJvrangesd+JvrangesdRJvrangesdRJ*https://www.felixcloutier.com/x86/vrangesdVRCP14PSVRCP14PSvrcp14psPCompute Approximate Reciprocals of Packed Single-Precision Floating-Point Values vrcp14ps9Hvrcp14ps:Hvrcp14ps;Hvrcp14psHvrcp14psHvrcp14psHvrcp14ps9Hvrcp14psHvrcp14ps:Hvrcp14psHvrcp14ps;Hvrcp14psH*https://www.felixcloutier.com/x86/vrcp14psCWDECWDEcwdeConvert Word to Doublewordcwtl/https://www.felixcloutier.com/x86/cbw:cwde:cdqeVROUNDSSVROUNDSSvroundss3Round Scalar Single Precision Floating-Point Valuesvroundss vroundss'  VCVTPH2UQQ VCVTPH2UQQ vcvtph2uqq^Convert Packed Half Precision Floating-Point Values to Packed Unsigned Quadword Integer Values vcvtph2uqq*K vcvtph2uqq.K vcvtph2uqq<R vcvtph2uqqK vcvtph2uqqK vcvtph2uqqR vcvtph2uqq*K vcvtph2uqqK vcvtph2uqq.K vcvtph2uqqK vcvtph2uqq<R vcvtph2uqqR vcvtph2uqqQR vcvtph2uqqQR,https://www.felixcloutier.com/x86/vcvtph2uqqVPXORQVPXORQvpxorq8Bitwise Logical Exclusive OR of Packed Quadword Integers vpxorq=HvpxorqHvpxorq?HvpxorqHvpxorqAHvpxorqHvpxorq=HvpxorqHvpxorq?HvpxorqHvpxorqAHvpxorqHMONITORXMONITORXmonitorx+Monitor a Linear Address Range with TimeoutmonitorxEMOVMOVmovMovemovbMOVB movbMOVB  movbMOVB #movwMOVW movwMOVW  movwMOVW $movabsl movlMOVLmovlMOVLmovlMOVL'movbMOVB#movbMOVB# movwMOVW$movwMOVW$ movlMOVL'movlMOVL'movabsl 'https://www.felixcloutier.com/x86/mov-2DIVSSDIVSSdivss4Divide Scalar Single-Precision Floating-Point ValuesdivssDIVSSdivssDIVSS''https://www.felixcloutier.com/x86/divssKORWKORWkorwBitwise Logical OR 16-bit MaskskorwH5https://www.felixcloutier.com/x86/korw:korb:korq:kordPMAXUDPMAXUDpmaxud.Maximum of Packed Unsigned Doubleword Integerspmaxudpmaxud//https://www.felixcloutier.com/x86/pmaxud:pmaxuqPMOVZXWDPMOVZXWDpmovzxwdDMove Packed Word Integers to Doubleword Integers with Zero Extensionpmovzxwdpmovzxwd+SETNLESETNLEsetnle4Set byte if not less or equal (ZF == 0 and SF == OF)setnleSETGT setnleSETGT#CLDCLDcldClear Direction FlagcldCLD%https://www.felixcloutier.com/x86/cldSUBSSSUBSSsubss6Subtract Scalar Single-Precision Floating-Point ValuessubssSUBSSsubssSUBSS''https://www.felixcloutier.com/x86/subssVHADDPSVHADDPSvhaddpsPacked Single-FP Horizontal Addvhaddps vhaddps/ vhaddps vhaddps2 AADDAADDaaddAtomically ADDaadd' CMPXCHG8B CMPXCHG8B cmpxchg8bCompare and Exchange 8 Bytes cmpxchg8b CMPXCHG8B+ 6https://www.felixcloutier.com/x86/cmpxchg8b:cmpxchg16bJNEJNEjneJump if not equal (ZF == 0)jneJNENjneJNEO VFMADD213PD VFMADD213PD vfmadd213pdCFused Multiply-Add of Packed Double-Precision Floating-Point Values vfmadd213pd=H vfmadd213pdH vfmadd213pd?H vfmadd213pdH vfmadd213pdAH vfmadd213pdH vfmadd213pd=H vfmadd213pd# vfmadd213pdH vfmadd213pd/# vfmadd213pd?H vfmadd213pd# vfmadd213pdH vfmadd213pd2# vfmadd213pdAH vfmadd213pdH vfmadd213pdQH vfmadd213pdQHEhttps://www.felixcloutier.com/x86/vfmadd132pd:vfmadd213pd:vfmadd231pdSARSARsarArithmetic Shift RightsarbSARB sarbSARB sarbSARB sarwSARW sarwSARW sarwSARW sarlSARLsarlSARLsarlSARLsarbSARB#sarbSARB#sarbSARB#sarwSARW$sarwSARW$sarwSARW$sarlSARL'sarlSARL'sarlSARL'1https://www.felixcloutier.com/x86/sal:sar:shl:shr VMASKMOVPD VMASKMOVPD vmaskmovpd>Conditional Move Packed Double-Precision Floating-Point Values vmaskmovpd/  vmaskmovpd2  vmaskmovpd/  vmaskmovpd2  VPCMPISTRM VPCMPISTRM vpcmpistrm3Packed Compare Implicit Length Strings, Return Mask vpcmpistrm  vpcmpistrm/ CVTPI2PDCVTPI2PDcvtpi2pdBConvert Packed Dword Integers to Packed Double-Precision FP Valuescvtpi2pdCVTPL2PDcvtpi2pdCVTPL2PD+*https://www.felixcloutier.com/x86/cvtpi2pdJNOJNOjnoJump if not overflow (OF == 0)jnoJOCNjnoJOCOPMOVZXBWPMOVZXBWpmovzxbw>Move Packed Byte Integers to Word Integers with Zero Extensionpmovzxbwpmovzxbw+VMOVDQU8VMOVDQU8vmovdqu8Move Unaligned Byte Valuesvmovdqu80Ivmovdqu8Ivmovdqu83Ivmovdqu8Ivmovdqu86Ivmovdqu8Ivmovdqu8/Ivmovdqu82Ivmovdqu85Ivmovdqu8Ivmovdqu8/Ivmovdqu8Ivmovdqu82Ivmovdqu8Ivmovdqu85Ivmovdqu8/Ivmovdqu82Ivmovdqu85IOhttps://www.felixcloutier.com/x86/movdqu:vmovdqu8:vmovdqu16:vmovdqu32:vmovdqu64 VPERMT2PS VPERMT2PS vpermt2psZFull Permute of Single-Precision Floating-Point Values From Two Tables Overwriting a Table  vpermt2ps9H vpermt2psH vpermt2ps:H vpermt2psH vpermt2ps;H vpermt2psH vpermt2ps9H vpermt2psH vpermt2ps:H vpermt2psH vpermt2ps;H vpermt2psHPhttps://www.felixcloutier.com/x86/vpermt2w:vpermt2d:vpermt2q:vpermt2ps:vpermt2pdPMOVZXDQPMOVZXDQpmovzxdqHMove Packed Doubleword Integers to Quadword Integers with Zero Extensionpmovzxdqpmovzxdq+ VFNMADD231PH VFNMADD231PH vfnmadd231phJFused Negative Multiply-Add of Packed Half-Precision Floating-Point Values vfnmadd231ph<K vfnmadd231phK vfnmadd231ph>K vfnmadd231phK vfnmadd231ph@R vfnmadd231phR vfnmadd231ph<K vfnmadd231phK vfnmadd231ph>K vfnmadd231phK vfnmadd231ph@R vfnmadd231phR vfnmadd231phQR vfnmadd231phQRlhttps://www.felixcloutier.com/x86/vfmadd132ph:vfnmadd132ph:vfmadd213ph:vfnmadd213ph:vfmadd231ph:vfnmadd231phVPRORVQVPRORVQvprorvq%Variable Rotate Packed Quadword Right vprorvq=HvprorvqHvprorvq?HvprorvqHvprorvqAHvprorvqHvprorvq=HvprorvqHvprorvq?HvprorvqHvprorvqAHvprorvqH?https://www.felixcloutier.com/x86/vprord:vprorvd:vprorq:vprorvqUCOMISDUCOMISDucomisdNUnordered Compare Scalar Double-Precision Floating-Point Values and Set EFLAGSucomisdUCOMISDucomisdUCOMISD+)https://www.felixcloutier.com/x86/ucomisdSETZSETZsetzSet byte if zero (ZF == 1)setzSETEQ setzSETEQ#VBROADCASTI32X4VBROADCASTI32X4vbroadcasti32x4"Broadcast Four Doubleword Elementsvbroadcasti32x4/Hvbroadcasti32x4/Hvbroadcasti32x4/Hvbroadcasti32x4/HVPCOMWVPCOMWvpcomw#Compare Packed Signed Word Integersvpcomw"vpcomw/" VSCATTERDPD VSCATTERDPD vscatterdpdTScatter Packed Double-Precision Floating-Point Values with Signed Doubleword Indices vscatterdpdCH vscatterdpdCH vscatterdpdGHQhttps://www.felixcloutier.com/x86/vscatterdps:vscatterdpd:vscatterqps:vscatterqpd VCVTPS2DQ VCVTPS2DQ vcvtps2dqBConvert Packed Single-Precision FP Values to Packed Dword Integers vcvtps2dq9H vcvtps2dq:H vcvtps2dq;H vcvtps2dqH vcvtps2dqH vcvtps2dqH vcvtps2dq9H vcvtps2dq  vcvtps2dqH vcvtps2dq/  vcvtps2dq:H vcvtps2dq  vcvtps2dqH vcvtps2dq2  vcvtps2dq;H vcvtps2dqH vcvtps2dqQH vcvtps2dqQH VRSQRT14SS VRSQRT14SS vrsqrt14ssaCompute Approximate Reciprocal of a Square Root of a Scalar Single-Precision Floating-Point Value vrsqrt14ssH vrsqrt14ss'H vrsqrt14ssH vrsqrt14ss'H,https://www.felixcloutier.com/x86/vrsqrt14ss VFMSUB132PD VFMSUB132PD vfmsub132pdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Values vfmsub132pd=H vfmsub132pdH vfmsub132pd?H vfmsub132pdH vfmsub132pdAH vfmsub132pdH vfmsub132pd=H vfmsub132pd# vfmsub132pdH vfmsub132pd/# vfmsub132pd?H vfmsub132pd# vfmsub132pdH vfmsub132pd2# vfmsub132pdAH vfmsub132pdH vfmsub132pdQH vfmsub132pdQHEhttps://www.felixcloutier.com/x86/vfmsub132pd:vfmsub213pd:vfmsub231pdVADDSDVADDSDvaddsd1Add Scalar Double-Precision Floating-Point ValuesvaddsdHvaddsd+Hvaddsd vaddsdHvaddsd+ vaddsd+HvaddsdQHvaddsdQHVORPSVORPSvorps<Bitwise Logical OR of Single-Precision Floating-Point Valuesvorps9JvorpsJvorps:JvorpsJvorps;JvorpsJvorps9Jvorps vorpsJvorps/ vorps:Jvorps vorpsJvorps2 vorps;JvorpsJSBBSBBsbbSubtract with BorrowsbbbSBBBsbbbSBBB sbbbSBBB  sbbbSBBB #sbbwSBBW sbbwSBBW sbbwSBBW sbbwSBBW  sbbwSBBW $sbblSBBLsbblSBBLsbblSBBLsbblSBBLsbblSBBL'sbbbSBBB#sbbbSBBB# sbbwSBBW$sbbwSBBW$sbbwSBBW$ sbblSBBL'sbblSBBL'sbblSBBL'%https://www.felixcloutier.com/x86/sbbUD2UD2ud2Undefined Instructionud2MOVSSMOVSSmovss2Move Scalar Single-Precision Floating-Point ValuesmovssMOVSSmovssMOVSS'movssMOVSS''https://www.felixcloutier.com/x86/movssVADDPHVADDPHvaddph/Add Packed Half-Precision Floating-Point Valuesvaddph<KvaddphKvaddph>KvaddphKvaddph@RvaddphRvaddph<KvaddphKvaddph>KvaddphKvaddph@RvaddphRvaddphQRvaddphQR(https://www.felixcloutier.com/x86/vaddph VPACKUSWB VPACKUSWB vpackuswb.Pack Words into Bytes with Unsigned Saturation vpackuswbI vpackuswb/I vpackuswbI vpackuswb2I vpackuswbI vpackuswb5I vpackuswb  vpackuswbI vpackuswb/  vpackuswb/I vpackuswb! vpackuswbI vpackuswb2! vpackuswb2I vpackuswbI vpackuswb5IVPRORQVPRORQvprorqRotate Packed Quadword Right vprorq=Hvprorq?HvprorqAHvprorqHvprorqHvprorqHvprorq=HvprorqHvprorq?HvprorqHvprorqAHvprorqH?https://www.felixcloutier.com/x86/vprord:vprorvd:vprorq:vprorvqKORDKORDkordBitwise Logical OR 32-bit MaskskordI5https://www.felixcloutier.com/x86/korw:korb:korq:kord VUNPCKLPS VUNPCKLPS vunpcklpsGUnpack and Interleave Low Packed Single-Precision Floating-Point Values vunpcklps9H vunpcklpsH vunpcklps:H vunpcklpsH vunpcklps;H vunpcklpsH vunpcklps9H vunpcklps  vunpcklpsH vunpcklps/  vunpcklps:H vunpcklps  vunpcklpsH vunpcklps2  vunpcklps;H vunpcklpsH PUNPCKHBW PUNPCKHBW punpckhbw1Unpack and Interleave High-Order Bytes into Words punpckhbw  punpckhbw+  punpckhbw punpckhbw/Jhttps://www.felixcloutier.com/x86/punpckhbw:punpckhwd:punpckhdq:punpckhqdq VCVTPH2DQ VCVTPH2DQ vcvtph2dq@Convert Packed Half-Precision FP Values to Packed Dword Integers vcvtph2dq.K vcvtph2dq<K vcvtph2dq>R vcvtph2dqK vcvtph2dqK vcvtph2dqR vcvtph2dq.K vcvtph2dqK vcvtph2dq<K vcvtph2dqK vcvtph2dq>R vcvtph2dqR vcvtph2dqQR vcvtph2dqQR+https://www.felixcloutier.com/x86/vcvtph2dqVPTESTMDVPTESTMDvptestmd<Logical AND of Packed Doubleword Integer Values and Set Mask vptestmd9Hvptestmd9HvptestmdHvptestmdHvptestmd:Hvptestmd:HvptestmdHvptestmdHvptestmd;Hvptestmd;HvptestmdHvptestmdHEhttps://www.felixcloutier.com/x86/vptestmb:vptestmw:vptestmd:vptestmqVMOVLPDVMOVLPDvmovlpd5Move Low Packed Double-Precision Floating-Point Valuevmovlpd+ vmovlpd+Hvmovlpd+ vmovlpd+H VPMOVUSDB VPMOVUSDB vpmovusdbMDown Convert Packed Doubleword Values to Byte Values with Unsigned Saturation  vpmovusdbH vpmovusdb(H vpmovusdbH vpmovusdb,H vpmovusdbH vpmovusdb0H vpmovusdbH vpmovusdbH vpmovusdbH vpmovusdb'H vpmovusdb+H vpmovusdb/H<https://www.felixcloutier.com/x86/vpmovdb:vpmovsdb:vpmovusdbCMOVBECMOVBEcmovbe+Move if below or equal (CF == 1 or ZF == 1)cmovbew  cmovbew $cmovbelcmovbel' VPMOVZXBD VPMOVZXBD vpmovzxbdDMove Packed Byte Integers to Doubleword Integers with Zero Extension vpmovzxbdH vpmovzxbdH vpmovzxbdH vpmovzxbd'H vpmovzxbd+H vpmovzxbd/H vpmovzxbd  vpmovzxbdH vpmovzxbd'  vpmovzxbd'H vpmovzxbd! vpmovzxbdH vpmovzxbd+! vpmovzxbd+H vpmovzxbdH vpmovzxbd/H VFMADD213SD VFMADD213SD vfmadd213sdCFused Multiply-Add of Scalar Double-Precision Floating-Point Values vfmadd213sdH vfmadd213sd+H vfmadd213sd# vfmadd213sdH vfmadd213sd+# vfmadd213sd+H vfmadd213sdQH vfmadd213sdQHEhttps://www.felixcloutier.com/x86/vfmadd132sd:vfmadd213sd:vfmadd231sd PCMPESTRM PCMPESTRM pcmpestrm3Packed Compare Explicit Length Strings, Return Mask pcmpestrml pcmpestrml/+https://www.felixcloutier.com/x86/pcmpestrmVPSLLDVPSLLDvpslld)Shift Packed Doubleword Data Left Logicalvpslld9Hvpslld:Hvpslld;HvpslldHvpslldHvpslld/HvpslldHvpslldHvpslld/HvpslldHvpslldHvpslld/Hvpslld9Hvpslld vpslldHvpslld vpslldHvpslld/ vpslld/Hvpslld:Hvpslld!vpslldHvpslld!vpslldHvpslld/!vpslld/Hvpslld;HvpslldHvpslldHvpslld/HPFSUBRPFSUBRpfsubr&Packed Floating-Point Subtract Reversepfsubrpfsubr+ VRSQRT28SD VRSQRT28SD vrsqrt28sdApproximation to the Reciprocal Square Root of a Scalar Double-Precision Floating-Point Value with Less Than 2^-28 Relative Error vrsqrt28sdM vrsqrt28sd+M vrsqrt28sdM vrsqrt28sd+M vrsqrt28sdRM vrsqrt28sdRM,https://www.felixcloutier.com/x86/vrsqrt28sd SHA256MSG1 SHA256MSG1 sha256msg1PPerform an Intermediate Calculation for the Next Four SHA256 Message Doublewords sha256msg1( sha256msg1/(,https://www.felixcloutier.com/x86/sha256msg1 VPBROADCASTB VPBROADCASTB vpbroadcastbBroadcast Byte Integer vpbroadcastbI vpbroadcastbI vpbroadcastbI vpbroadcastbI vpbroadcastbI vpbroadcastbI vpbroadcastb#I vpbroadcastb#I vpbroadcastb#I vpbroadcastbI vpbroadcastb! vpbroadcastbI vpbroadcastb#! vpbroadcastb#I vpbroadcastbI vpbroadcastb! vpbroadcastbI vpbroadcastb#! vpbroadcastb#I vpbroadcastbI vpbroadcastbI vpbroadcastb#IUhttps://www.felixcloutier.com/x86/vpbroadcastb:vpbroadcastw:vpbroadcastd:vpbroadcastq VFNMADD213SD VFNMADD213SD vfnmadd213sdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Values vfnmadd213sdH vfnmadd213sd+H vfnmadd213sd# vfnmadd213sdH vfnmadd213sd+# vfnmadd213sd+H vfnmadd213sdQH vfnmadd213sdQHHhttps://www.felixcloutier.com/x86/vfnmadd132sd:vfnmadd213sd:vfnmadd231sdPFMINPFMINpfminPacked Floating-Point Minimumpfminpfmin+VRANGESSVRANGESSvrangessYRange Restriction Calculation For a pair of Scalar Single-Precision Floating-Point ValuesvrangessJvrangess'JvrangessJvrangess'JvrangessRJvrangessRJ*https://www.felixcloutier.com/x86/vrangess CVTTSD2SI CVTTSD2SI cvttsd2siJConvert with Truncation Scalar Double-Precision FP Value to Signed Integer cvttsd2si CVTTSD2SL cvttsd2si CVTTSD2SL++https://www.felixcloutier.com/x86/cvttsd2si VCVTTPS2UQQ VCVTTPS2UQQ vcvttps2uqqpConvert with Truncation Packed Single Precision Floating-Point Values to Packed Unsigned Quadword Integer Values vcvttps2uqq8J vcvttps2uqq9J vcvttps2uqq:J vcvttps2uqqJ vcvttps2uqqJ vcvttps2uqqJ vcvttps2uqq8J vcvttps2uqqJ vcvttps2uqq9J vcvttps2uqqJ vcvttps2uqq:J vcvttps2uqqJ vcvttps2uqqRJ vcvttps2uqqRJ-https://www.felixcloutier.com/x86/vcvttps2uqq VPDPBSUDS VPDPBSUDS vpdpbsudsXPacked Dot Product of Signed-by-Unsinged Byte subvectors into Doubleword with Saturation vpdpbsudsX vpdpbsuds/X vpdpbsudsX vpdpbsuds2XADOXADOXadox<Unsigned Integer Addition of Two Operands with Overflow Flagadoxl7adoxl'7&https://www.felixcloutier.com/x86/adoxBLCFILLBLCFILLblcfillFill From Lowest Clear Bitblcfill6blcfill'6BLSFILLBLSFILLblsfillFill From Lowest Set Bitblsfill6blsfill'6PREFETCHPREFETCHprefetchPrefetch Data into Cachesprefetch#@RSQRTSSRSQRTSSrsqrtssQCompute Reciprocal of Square Root of Scalar Single-Precision Floating-Point ValuersqrtssRSQRTSSrsqrtssRSQRTSS')https://www.felixcloutier.com/x86/rsqrtssSETNBSETNBsetnbSet byte if not below (CF == 0)setnbSETCC setnbSETCC#VALIGNDVALIGNDvaligndAlign Doubleword Vectors valignd9HvaligndHvalignd:HvaligndHvalignd;HvaligndHvalignd9HvaligndHvalignd:HvaligndHvalignd;HvaligndH1https://www.felixcloutier.com/x86/valignd:valignqDIVPDDIVPDdivpd4Divide Packed Double-Precision Floating-Point ValuesdivpdDIVPDdivpdDIVPD/'https://www.felixcloutier.com/x86/divpd VBROADCASTSD VBROADCASTSD vbroadcastsd1Broadcast Double-Precision Floating-Point Element  vbroadcastsdH vbroadcastsdH vbroadcastsd+H vbroadcastsd+H vbroadcastsd! vbroadcastsdH vbroadcastsd+  vbroadcastsd+H vbroadcastsdH vbroadcastsd+HVFMSUBSSVFMSUBSSvfmsubssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfmsubss$vfmsubss'$vfmsubss'$VPBLENDWVPBLENDWvpblendwBlend Packed Wordsvpblendw vpblendw/ vpblendw!vpblendw2!VPMAXSDVPMAXSDvpmaxsd,Maximum of Packed Signed Doubleword Integersvpmaxsd9HvpmaxsdHvpmaxsd:HvpmaxsdHvpmaxsd;HvpmaxsdHvpmaxsd9Hvpmaxsd vpmaxsdHvpmaxsd/ vpmaxsd:Hvpmaxsd!vpmaxsdHvpmaxsd2!vpmaxsd;HvpmaxsdH VPSCATTERDD VPSCATTERDD vpscatterdd?Scatter Packed Doubleword Values with Signed Doubleword Indices vpscatterddCH vpscatterddGH vpscatterddKHQhttps://www.felixcloutier.com/x86/vpscatterdd:vpscatterdq:vpscatterqd:vpscatterqqKORTESTDKORTESTDkortestdOR 32-bit Masks and Set FlagskortestdIEhttps://www.felixcloutier.com/x86/kortestw:kortestb:kortestq:kortestdVPSUBBVPSUBBvpsubbSubtract Packed Byte IntegersvpsubbIvpsubb/IvpsubbIvpsubb2IvpsubbIvpsubb5Ivpsubb vpsubbIvpsubb/ vpsubb/Ivpsubb!vpsubbIvpsubb2!vpsubb2IvpsubbIvpsubb5ICBWCBWcbwConvert Byte to Wordcbtw/https://www.felixcloutier.com/x86/cbw:cwde:cdqeCMOVNZCMOVNZcmovnzMove if not zero (ZF == 0)cmovnzw  cmovnzw $cmovnzlcmovnzl' VEXTRACTI64X4 VEXTRACTI64X4 vextracti64x42Extract 256 Bits of Packed Quadword Integer Values vextracti64x4H vextracti64x43H vextracti64x4H vextracti64x42HVPXORDVPXORDvpxord:Bitwise Logical Exclusive OR of Packed Doubleword Integers vpxord9HvpxordHvpxord:HvpxordHvpxord;HvpxordHvpxord9HvpxordHvpxord:HvpxordHvpxord;HvpxordHVRCP28PSVRCP28PSvrcp28pstApproximation to the Reciprocal of Packed Single-Precision Floating-Point Values with Less Than 2^-28 Relative Errorvrcp28ps;Mvrcp28psMvrcp28ps;Mvrcp28psMvrcp28psRMvrcp28psRM*https://www.felixcloutier.com/x86/vrcp28psPMULLWPMULLWpmullw9Multiply Packed Signed Word Integers and Store Low Resultpmullw pmullw+ pmullwpmullw/(https://www.felixcloutier.com/x86/pmullw VFNMADDSS VFNMADDSS vfnmaddssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Values vfnmaddss$ vfnmaddss'$ vfnmaddss'$ PCMPISTRI PCMPISTRI pcmpistri4Packed Compare Implicit Length Strings, Return Index pcmpistri pcmpistri/+https://www.felixcloutier.com/x86/pcmpistriMOVUPDMOVUPDmovupd<Move Unaligned Packed Double-Precision Floating-Point ValuesmovupdMOVUPDmovupdMOVUPD/movupdMOVUPD/(https://www.felixcloutier.com/x86/movupdVPANDNQVPANDNQvpandnq3Bitwise Logical AND NOT of Packed Quadword Integers vpandnq=HvpandnqHvpandnq?HvpandnqHvpandnqAHvpandnqHvpandnq=HvpandnqHvpandnq?HvpandnqHvpandnqAHvpandnqH VRSQRT14PS VRSQRT14PS vrsqrt14ps`Compute Approximate Reciprocals of Square Roots of Packed Single-Precision Floating-Point Values  vrsqrt14ps9H vrsqrt14ps:H vrsqrt14ps;H vrsqrt14psH vrsqrt14psH vrsqrt14psH vrsqrt14ps9H vrsqrt14psH vrsqrt14ps:H vrsqrt14psH vrsqrt14ps;H vrsqrt14psH,https://www.felixcloutier.com/x86/vrsqrt14psAESENCAESENCaesenc+Perform One Round of an AES Encryption FlowaesencAESENC'aesencAESENC/'(https://www.felixcloutier.com/x86/aesenc AESENCLAST AESENCLAST aesenclast,Perform Last Round of an AES Encryption Flow aesenclast' aesenclast/',https://www.felixcloutier.com/x86/aesenclastVSUBPDVSUBPDvsubpd6Subtract Packed Double-Precision Floating-Point Valuesvsubpd=HvsubpdHvsubpd?HvsubpdHvsubpdAHvsubpdHvsubpd=Hvsubpd vsubpdHvsubpd/ vsubpd?Hvsubpd vsubpdHvsubpd2 vsubpdAHvsubpdHvsubpdQHvsubpdQH VCVTSI2SD VCVTSI2SD vcvtsi2sd9Convert Dword Integer to Scalar Double-Precision FP Value vcvtsi2sdl  vcvtsi2sdlH vcvtsi2sdl'  vcvtsi2sdl'HPSRLDPSRLDpsrld*Shift Packed Doubleword Data Right Logicalpsrld psrld psrld+ psrldpsrldpsrld/3https://www.felixcloutier.com/x86/psrlw:psrld:psrlq VPUNPCKHBW VPUNPCKHBW vpunpckhbw1Unpack and Interleave High-Order Bytes into Words vpunpckhbwI vpunpckhbw/I vpunpckhbwI vpunpckhbw2I vpunpckhbwI vpunpckhbw5I vpunpckhbw  vpunpckhbwI vpunpckhbw/  vpunpckhbw/I vpunpckhbw! vpunpckhbwI vpunpckhbw2! vpunpckhbw2I vpunpckhbwI vpunpckhbw5I VPSCATTERDQ VPSCATTERDQ vpscatterdq=Scatter Packed Quadword Values with Signed Doubleword Indices vpscatterdqCH vpscatterdqCH vpscatterdqGHQhttps://www.felixcloutier.com/x86/vpscatterdd:vpscatterdq:vpscatterqd:vpscatterqq VCVTPH2PD VCVTPH2PD vcvtph2pdLConvert Packed Half-Precision FP Values to Packed Double-Precision FP Values vcvtph2pd*K vcvtph2pd.K vcvtph2pd<R vcvtph2pdK vcvtph2pdK vcvtph2pdR vcvtph2pd*K vcvtph2pdK vcvtph2pd.K vcvtph2pdK vcvtph2pd<R vcvtph2pdR vcvtph2pdRR vcvtph2pdRR+https://www.felixcloutier.com/x86/vcvtph2pd VDPBF16PS VDPBF16PS vdpbf16psLPacked Dot Product of BFloat16 FP subvectors into Single-Precision FP values  vdpbf16ps9K vdpbf16psK vdpbf16ps:K vdpbf16psK vdpbf16ps;Q vdpbf16psQ vdpbf16ps9K vdpbf16psK vdpbf16ps:K vdpbf16psK vdpbf16ps;Q vdpbf16psQ+https://www.felixcloutier.com/x86/vdpbf16psROUNDPSROUNDPSroundps3Round Packed Single Precision Floating-Point Valuesroundpsroundps/)https://www.felixcloutier.com/x86/roundpsVPMULTISHIFTQBVPMULTISHIFTQBvpmultishiftqb3Select Packed Unaligned Bytes from Quadword Sources vpmultishiftqb=KvpmultishiftqbKvpmultishiftqb?KvpmultishiftqbKvpmultishiftqbATvpmultishiftqbTvpmultishiftqb=KvpmultishiftqbKvpmultishiftqb?KvpmultishiftqbKvpmultishiftqbATvpmultishiftqbT0https://www.felixcloutier.com/x86/vpmultishiftqbVPSHLDVPSHLDvpshld Packed Shift Logical Doublewordsvpshld"vpshld/"vpshld/"(https://www.felixcloutier.com/x86/vpshldVBROADCASTF32X2VBROADCASTF32X2vbroadcastf32x26Broadcast Two Single-Precision Floating-Point Elementsvbroadcastf32x2Jvbroadcastf32x2Jvbroadcastf32x2+Jvbroadcastf32x2+Jvbroadcastf32x2Jvbroadcastf32x2+Jvbroadcastf32x2Jvbroadcastf32x2+JSETNZSETNZsetnzSet byte if not zero (ZF == 0)setnzSETNE setnzSETNE#VADDSHVADDSHvaddsh/Add Scalar Half-Precision Floating-Point ValuesvaddshRvaddsh$RvaddshRvaddsh$RvaddshQRvaddshQR(https://www.felixcloutier.com/x86/vaddshVALIGNQVALIGNQvalignqAlign Quadword Vectors valignq=HvalignqHvalignq?HvalignqHvalignqAHvalignqHvalignq=HvalignqHvalignq?HvalignqHvalignqAHvalignqH1https://www.felixcloutier.com/x86/valignd:valignqPOPCNTPOPCNTpopcnt Count of Number of Bits Set to 1popcntw  2popcntw $2popcntl2popcntl'2(https://www.felixcloutier.com/x86/popcntPMOVSXBWPMOVSXBWpmovsxbw>Move Packed Byte Integers to Word Integers with Sign Extensionpmovsxbwpmovsxbw+ VFMADD132PD VFMADD132PD vfmadd132pdCFused Multiply-Add of Packed Double-Precision Floating-Point Values vfmadd132pd=H vfmadd132pdH vfmadd132pd?H vfmadd132pdH vfmadd132pdAH vfmadd132pdH vfmadd132pd=H vfmadd132pd# vfmadd132pdH vfmadd132pd/# vfmadd132pd?H vfmadd132pd# vfmadd132pdH vfmadd132pd2# vfmadd132pdAH vfmadd132pdH vfmadd132pdQH vfmadd132pdQHEhttps://www.felixcloutier.com/x86/vfmadd132pd:vfmadd213pd:vfmadd231pdVMAXSDVMAXSDvmaxsd;Return Maximum Scalar Double-Precision Floating-Point ValuevmaxsdHvmaxsd+Hvmaxsd vmaxsdHvmaxsd+ vmaxsd+HvmaxsdRHvmaxsdRHPMAXSBPMAXSBpmaxsb&Maximum of Packed Signed Byte Integerspmaxsbpmaxsb/=https://www.felixcloutier.com/x86/pmaxsb:pmaxsw:pmaxsd:pmaxsqSUBSUBsubSubtractsubbSUBBsubbSUBB subbSUBB  subbSUBB #subwSUBW subwSUBW subwSUBW subwSUBW  subwSUBW $sublSUBLsublSUBLsublSUBLsublSUBLsublSUBL'subbSUBB#subbSUBB# subwSUBW$subwSUBW$subwSUBW$ sublSUBL'sublSUBL'sublSUBL'%https://www.felixcloutier.com/x86/subPALIGNRPALIGNRpalignrPacked Align Rightpalignrpalignr+palignrpalignr/)https://www.felixcloutier.com/x86/palignrSHLDSHLDshld#Integer Double Precision Shift Leftshldw  shldw  shldlshldlshldw$ shldw$ shldl'shldl'&https://www.felixcloutier.com/x86/shldVPCOMUDVPCOMUDvpcomud+Compare Packed Unsigned Doubleword Integersvpcomud"vpcomud/" VPMOVSXWQ VPMOVSXWQ vpmovsxwqBMove Packed Word Integers to Quadword Integers with Sign Extension vpmovsxwqH vpmovsxwqH vpmovsxwqH vpmovsxwq'H vpmovsxwq+H vpmovsxwq/H vpmovsxwq  vpmovsxwqH vpmovsxwq'  vpmovsxwq'H vpmovsxwq! vpmovsxwqH vpmovsxwq+! vpmovsxwq+H vpmovsxwqH vpmovsxwq/HVPBROADCASTMW2DVPBROADCASTMW2Dvpbroadcastmw2d?Broadcast Low Word of Mask Register to Packed Doubleword Valuesvpbroadcastmw2dNvpbroadcastmw2dNvpbroadcastmw2dN VSCATTERQPS VSCATTERQPS vscatterqpsRScatter Packed Single-Precision Floating-Point Values with Signed Quadword Indices vscatterqpsEH vscatterqpsIH vscatterqpsMHQhttps://www.felixcloutier.com/x86/vscatterdps:vscatterdpd:vscatterqps:vscatterqpdVBROADCASTI64X4VBROADCASTI64X4vbroadcasti64x4 Broadcast Four Quadword Elementsvbroadcasti64x42Hvbroadcasti64x42HPCMPEQDPCMPEQDpcmpeqd+Compare Packed Doubleword Data for Equalitypcmpeqd pcmpeqd+ pcmpeqdpcmpeqd/9https://www.felixcloutier.com/x86/pcmpeqb:pcmpeqw:pcmpeqdVPMINUWVPMINUWvpminuw(Minimum of Packed Unsigned Word IntegersvpminuwIvpminuw/IvpminuwIvpminuw2IvpminuwIvpminuw5Ivpminuw vpminuwIvpminuw/ vpminuw/Ivpminuw!vpminuwIvpminuw2!vpminuw2IvpminuwIvpminuw5IVMPSADBWVMPSADBWvmpsadbw3Compute Multiple Packed Sums of Absolute Differencevmpsadbw vmpsadbw/ vmpsadbw!vmpsadbw2!MWAITXMWAITXmwaitxMonitor Wait with TimeoutmwaitxEVFMULCPHVFMULCPHvfmulcphKFused Fused Multiply of Complex Packed Half-Precision Floating-Point Valuesvfmulcph9KvfmulcphKvfmulcph:KvfmulcphKvfmulcph;RvfmulcphRvfmulcph9KvfmulcphKvfmulcph:KvfmulcphKvfmulcph;RvfmulcphRvfmulcphQRvfmulcphQR4https://www.felixcloutier.com/x86/vfcmulcph:vfmulcphVCOMISDVCOMISDvcomisdLCompare Scalar Ordered Double-Precision Floating-Point Values and Set EFLAGSvcomisd vcomisdHvcomisd+ vcomisd+HvcomisdRHVPROLVDVPROLVDvprolvd&Variable Rotate Packed Doubleword Left vprolvd9HvprolvdHvprolvd:HvprolvdHvprolvd;HvprolvdHvprolvd9HvprolvdHvprolvd:HvprolvdHvprolvd;HvprolvdH?https://www.felixcloutier.com/x86/vprold:vprolvd:vprolq:vprolvqCMPSSCMPSScmpss5Compare Scalar Single-Precision Floating-Point ValuescmpssCMPSScmpssCMPSS''https://www.felixcloutier.com/x86/cmpssMOVNTQMOVNTQmovntq)Store of Quadword Using Non-Temporal Hintmovntq+ (https://www.felixcloutier.com/x86/movntqROUNDSSROUNDSSroundss3Round Scalar Single Precision Floating-Point Valuesroundssroundss')https://www.felixcloutier.com/x86/roundssVPSHLBVPSHLBvpshlbPacked Shift Logical Bytesvpshlb"vpshlb/"vpshlb/"VPCMPUDVPCMPUDvpcmpud)Compare Packed Unsigned Doubleword Values vpcmpud9Hvpcmpud9HvpcmpudHvpcmpudHvpcmpud:Hvpcmpud:HvpcmpudHvpcmpudHvpcmpud;Hvpcmpud;HvpcmpudHvpcmpudH0https://www.felixcloutier.com/x86/vpcmpd:vpcmpudMOVSHDUPMOVSHDUPmovshdup(Move Packed Single-FP High and Duplicatemovshdupmovshdup/*https://www.felixcloutier.com/x86/movshdup VFMSUB231SS VFMSUB231SS vfmsub231ssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfmsub231ssH vfmsub231ss'H vfmsub231ss# vfmsub231ssH vfmsub231ss'# vfmsub231ss'H vfmsub231ssQH vfmsub231ssQHEhttps://www.felixcloutier.com/x86/vfmsub132ss:vfmsub213ss:vfmsub231ssMOVAPDMOVAPDmovapd:Move Aligned Packed Double-Precision Floating-Point ValuesmovapdMOVAPDmovapdMOVAPD/movapdMOVAPD/(https://www.felixcloutier.com/x86/movapdKXNORWKXNORWkxnorw!Bitwise Logical XNOR 16-bit MaskskxnorwH=https://www.felixcloutier.com/x86/kxnorw:kxnorb:kxnorq:kxnord VPSCATTERQQ VPSCATTERQQ vpscatterqq;Scatter Packed Quadword Values with Signed Quadword Indices vpscatterqqEH vpscatterqqIH vpscatterqqMHQhttps://www.felixcloutier.com/x86/vpscatterdd:vpscatterdq:vpscatterqd:vpscatterqq CVTTPS2PI CVTTPS2PI cvttps2piRConvert with Truncation Packed Single-Precision FP Values to Packed Dword Integers cvttps2pi CVTTPS2PL cvttps2pi CVTTPS2PL++https://www.felixcloutier.com/x86/cvttps2piPHADDDPHADDDphaddd(Packed Horizontal Add Doubleword Integerphadddphaddd+phadddphaddd//https://www.felixcloutier.com/x86/phaddw:phadddVBROADCASTI32X2VBROADCASTI32X2vbroadcasti32x2!Broadcast Two Doubleword Elements vbroadcasti32x2Jvbroadcasti32x2Jvbroadcasti32x2Jvbroadcasti32x2+Jvbroadcasti32x2+Jvbroadcasti32x2+Jvbroadcasti32x2Jvbroadcasti32x2+Jvbroadcasti32x2Jvbroadcasti32x2+Jvbroadcasti32x2Jvbroadcasti32x2+J VFMADD132PH VFMADD132PH vfmadd132phAFused Multiply-Add of Packed Half-Precision Floating-Point Values vfmadd132ph<K vfmadd132phK vfmadd132ph>K vfmadd132phK vfmadd132ph@R vfmadd132phR vfmadd132ph<K vfmadd132phK vfmadd132ph>K vfmadd132phK vfmadd132ph@R vfmadd132phR vfmadd132phQR vfmadd132phQRlhttps://www.felixcloutier.com/x86/vfmadd132ph:vfnmadd132ph:vfmadd213ph:vfnmadd213ph:vfmadd231ph:vfnmadd231phVMOVLHPSVMOVLHPSvmovlhps>Move Packed Single-Precision Floating-Point Values Low to Highvmovlhps vmovlhpsH VPMACSSWD VPMACSSWD vpmacsswdKPacked Multiply Accumulate with Saturation Signed Word to Signed Doubleword vpmacsswd" vpmacsswd/"BLSRBLSRblsrReset Lowest Set Bitblsrl4blsrl'4&https://www.felixcloutier.com/x86/blsrVMINSDVMINSDvminsd;Return Minimum Scalar Double-Precision Floating-Point ValuevminsdHvminsd+Hvminsd vminsdHvminsd+ vminsd+HvminsdRHvminsdRH VPMADCSSWD VPMADCSSWD vpmadcsswdOPacked Multiply Add Accumulate with Saturation Signed Word to Signed Doubleword vpmadcsswd" vpmadcsswd/"VRCP28SSVRCP28SSvrcp28ssuApproximation to the Reciprocal of a Scalar Single-Precision Floating-Point Value with Less Than 2^-28 Relative Errorvrcp28ssMvrcp28ss'Mvrcp28ssMvrcp28ss'Mvrcp28ssRMvrcp28ssRM*https://www.felixcloutier.com/x86/vrcp28ssPSUBDPSUBDpsubd#Subtract Packed Doubleword IntegerspsubdPSUBL psubdPSUBL+ psubdPSUBLpsubdPSUBL/3https://www.felixcloutier.com/x86/psubb:psubw:psubdPSHUFBPSHUFBpshufbPacked Shuffle BytespshufbPSHUFBpshufbPSHUFB+pshufbPSHUFBpshufbPSHUFB/(https://www.felixcloutier.com/x86/pshufb VPBROADCASTW VPBROADCASTW vpbroadcastwBroadcast Word Integer vpbroadcastwI vpbroadcastwI vpbroadcastwI vpbroadcastwI vpbroadcastwI vpbroadcastwI vpbroadcastw$I vpbroadcastw$I vpbroadcastw$I vpbroadcastwI vpbroadcastw! vpbroadcastwI vpbroadcastw$! vpbroadcastw$I vpbroadcastwI vpbroadcastw! vpbroadcastwI vpbroadcastw$! vpbroadcastw$I vpbroadcastwI vpbroadcastwI vpbroadcastw$IUhttps://www.felixcloutier.com/x86/vpbroadcastb:vpbroadcastw:vpbroadcastd:vpbroadcastqHSUBPSHSUBPShsubps$Packed Single-FP Horizontal Subtracthsubpshsubps/(https://www.felixcloutier.com/x86/hsubps VFMADD213PS VFMADD213PS vfmadd213psCFused Multiply-Add of Packed Single-Precision Floating-Point Values vfmadd213ps9H vfmadd213psH vfmadd213ps:H vfmadd213psH vfmadd213ps;H vfmadd213psH vfmadd213ps9H vfmadd213ps# vfmadd213psH vfmadd213ps/# vfmadd213ps:H vfmadd213ps# vfmadd213psH vfmadd213ps2# vfmadd213ps;H vfmadd213psH vfmadd213psQH vfmadd213psQHEhttps://www.felixcloutier.com/x86/vfmadd132ps:vfmadd213ps:vfmadd231ps VFMSUB213PD VFMSUB213PD vfmsub213pdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Values vfmsub213pd=H vfmsub213pdH vfmsub213pd?H vfmsub213pdH vfmsub213pdAH vfmsub213pdH vfmsub213pd=H vfmsub213pd# vfmsub213pdH vfmsub213pd/# vfmsub213pd?H vfmsub213pd# vfmsub213pdH vfmsub213pd2# vfmsub213pdAH vfmsub213pdH vfmsub213pdQH vfmsub213pdQHEhttps://www.felixcloutier.com/x86/vfmsub132pd:vfmsub213pd:vfmsub231pdCMOVLECMOVLEcmovle+Move if less or equal (ZF == 1 or SF != OF)cmovlew  cmovlew $cmovlelcmovlel' VMOVDQU16 VMOVDQU16 vmovdqu16Move Unaligned Word Values vmovdqu160I vmovdqu16I vmovdqu163I vmovdqu16I vmovdqu166I vmovdqu16I vmovdqu16/I vmovdqu162I vmovdqu165I vmovdqu16I vmovdqu16/I vmovdqu16I vmovdqu162I vmovdqu16I vmovdqu165I vmovdqu16/I vmovdqu162I vmovdqu165IOhttps://www.felixcloutier.com/x86/movdqu:vmovdqu8:vmovdqu16:vmovdqu32:vmovdqu64VCVTNEOBF162PSVCVTNEOBF162PSvcvtneobf162ps9Convert Odd Elements of Packed BF16 Values to FP32 Valuesvcvtneobf162ps/Zvcvtneobf162ps2Z VCVTUQQ2PH VCVTUQQ2PH vcvtuqq2phXConvert Packed Unsigned Quadword Integers to Packed Half-Precision Floating-Point Values vcvtuqq2phx=K vcvtuqq2phy?K vcvtuqq2phzAR vcvtuqq2phxK vcvtuqq2phyK vcvtuqq2phzR vcvtuqq2phx=K vcvtuqq2phy?K vcvtuqq2phzAR vcvtuqq2phxK vcvtuqq2phyK vcvtuqq2phzR vcvtuqq2phzQR vcvtuqq2phzQR,https://www.felixcloutier.com/x86/vcvtuqq2phVPHADDBQVPHADDBQvphaddbq4Packed Horizontal Add Signed Byte to Signed Quadwordvphaddbq"vphaddbq/"SHRDSHRDshrd$Integer Double Precision Shift Rightshrdw  shrdw  shrdlshrdlshrdw$ shrdw$ shrdl'shrdl'&https://www.felixcloutier.com/x86/shrd VGETEXPSH VGETEXPSH vgetexpsheExtract Exponent of Scalar Half-Precision Floating-Point Value as Half-Precision Floating-Point Value vgetexpshR vgetexpsh$R vgetexpshR vgetexpsh$R vgetexpshRR vgetexpshRR+https://www.felixcloutier.com/x86/vgetexpshVPEXTRWVPEXTRWvpextrw Extract Wordvpextrw vpextrwIvpextrw$ vpextrw$IVPCOMUBVPCOMUBvpcomub%Compare Packed Unsigned Byte Integersvpcomub"vpcomub/" VFNMSUB213PD VFNMSUB213PD vfnmsub213pdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Values vfnmsub213pd=H vfnmsub213pdH vfnmsub213pd?H vfnmsub213pdH vfnmsub213pdAH vfnmsub213pdH vfnmsub213pd=H vfnmsub213pd# vfnmsub213pdH vfnmsub213pd/# vfnmsub213pd?H vfnmsub213pd# vfnmsub213pdH vfnmsub213pd2# vfnmsub213pdAH vfnmsub213pdH vfnmsub213pdQH vfnmsub213pdQHHhttps://www.felixcloutier.com/x86/vfnmsub132pd:vfnmsub213pd:vfnmsub231pdVPSHRDDVPSHRDDvpshrdd:Concatenate and Shift Packed Doubleword Data Right Logical vpshrdd9KvpshrddKvpshrdd:KvpshrddKvpshrdd;UvpshrddUvpshrdd9KvpshrddKvpshrdd:KvpshrddKvpshrdd;UvpshrddUVPSUBUSWVPSUBUSWvpsubusw?Subtract Packed Unsigned Word Integers with Unsigned SaturationvpsubuswIvpsubusw/IvpsubuswIvpsubusw2IvpsubuswIvpsubusw5Ivpsubusw vpsubuswIvpsubusw/ vpsubusw/Ivpsubusw!vpsubuswIvpsubusw2!vpsubusw2IvpsubuswIvpsubusw5IADDADDaddAddaddbADDBaddbADDB addbADDB  addbADDB #addwADDW addwADDW addwADDW addwADDW  addwADDW $addlADDLaddlADDLaddlADDLaddlADDLaddlADDL'addbADDB#addbADDB# addwADDW$addwADDW$addwADDW$ addlADDL'addlADDL'addlADDL'%https://www.felixcloutier.com/x86/addVCMPSHVCMPSHvcmpsh3Compare Scalar Half-Precision Floating-Point ValuesvcmpshRvcmpshRvcmpsh$Rvcmpsh$RvcmpshRRvcmpshRR(https://www.felixcloutier.com/x86/vcmpsh VSHUFF64X2 VSHUFF64X2 vshuff64x2=Shuffle 128-Bit Packed Double-Precision Floating-Point Values vshuff64x2?H vshuff64x2H vshuff64x2AH vshuff64x2H vshuff64x2?H vshuff64x2H vshuff64x2AH vshuff64x2H VFMADD231SS VFMADD231SS vfmadd231ssCFused Multiply-Add of Scalar Single-Precision Floating-Point Values vfmadd231ssH vfmadd231ss'H vfmadd231ss# vfmadd231ssH vfmadd231ss'# vfmadd231ss'H vfmadd231ssQH vfmadd231ssQHEhttps://www.felixcloutier.com/x86/vfmadd132ss:vfmadd213ss:vfmadd231ssROLROLrol Rotate LeftrolbROLB rolbROLB rolbROLB rolwROLW rolwROLW rolwROLW rollROLLrollROLLrollROLLrolbROLB#rolbROLB#rolbROLB#rolwROLW$rolwROLW$rolwROLW$rollROLL'rollROLL'rollROLL'1https://www.felixcloutier.com/x86/rcl:rcr:rol:rorPFRCPVPFRCPVpfrcpv5Packed Floating-Point Reciprocal Approximation Vectorpfrcpvpfrcpv+CVTPI2PSCVTPI2PScvtpi2psBConvert Packed Dword Integers to Packed Single-Precision FP Valuescvtpi2psCVTPL2PScvtpi2psCVTPL2PS+*https://www.felixcloutier.com/x86/cvtpi2ps VFPCLASSPD VFPCLASSPD vfpclasspd;Test Class of Packed Double-Precision Floating-Point Values  vfpclasspdx=J vfpclasspdx=J vfpclasspdy?J vfpclasspdy?J vfpclasspdzAJ vfpclasspdzAJ vfpclasspdxJ vfpclasspdxJ vfpclasspdyJ vfpclasspdyJ vfpclasspdzJ vfpclasspdzJ,https://www.felixcloutier.com/x86/vfpclasspd VPDPBSSDS VPDPBSSDS vpdpbssdsVPacked Dot Product of Signed-by-Singed Byte subvectors into Doubleword with Saturation vpdpbssdsX vpdpbssds/X vpdpbssdsX vpdpbssds2XVSUBPSVSUBPSvsubps6Subtract Packed Single-Precision Floating-Point Valuesvsubps9HvsubpsHvsubps:HvsubpsHvsubps;HvsubpsHvsubps9Hvsubps vsubpsHvsubps/ vsubps:Hvsubps vsubpsHvsubps2 vsubps;HvsubpsHvsubpsQHvsubpsQH CLFLUSHOPT CLFLUSHOPT clflushoptFlush Cache Line Optimized clflushopt#:,https://www.felixcloutier.com/x86/clflushoptVPSHRDWVPSHRDWvpshrdw4Concatenate and Shift Packed Word Data Right Logical vpshrdwKvpshrdw/KvpshrdwKvpshrdw2KvpshrdwUvpshrdw5UvpshrdwKvpshrdw/KvpshrdwKvpshrdw2KvpshrdwUvpshrdw5UANDANDand Logical ANDandbANDBandbANDB andbANDB  andbANDB #andwANDW andwANDW andwANDW andwANDW  andwANDW $andlANDLandlANDLandlANDLandlANDLandlANDL'andbANDB#andbANDB# andwANDW$andwANDW$andwANDW$ andlANDL'andlANDL'andlANDL'%https://www.felixcloutier.com/x86/andPANDNPANDNpandnPacked Bitwise Logical AND NOTpandn pandn+ pandnpandn/'https://www.felixcloutier.com/x86/pandnVRANGEPSVRANGEPSvrangepsXRange Restriction Calculation For Packed Pairs of Single-Precision Floating-Point Valuesvrangeps9JvrangepsJvrangeps:JvrangepsJvrangeps;JvrangepsJvrangeps9JvrangepsJvrangeps:JvrangepsJvrangeps;JvrangepsJvrangepsRJvrangepsRJ*https://www.felixcloutier.com/x86/vrangepsPADDSBPADDSBpaddsb6Add Packed Signed Byte Integers with Signed Saturationpaddsb paddsb+ paddsbpaddsb//https://www.felixcloutier.com/x86/paddsb:paddswSETNESETNEsetneSet byte if not equal (ZF == 0)setneSETNE setneSETNE# VPCMPESTRI VPCMPESTRI vpcmpestri4Packed Compare Explicit Length Strings, Return Index vpcmpestril  vpcmpestril/ PSHUFLWPSHUFLWpshuflwShuffle Packed Low WordspshuflwPSHUFLWpshuflwPSHUFLW/)https://www.felixcloutier.com/x86/pshuflw VCVTSS2SI VCVTSS2SI vcvtss2si9Convert Scalar Single-Precision FP Value to Dword Integer vcvtss2si  vcvtss2siH vcvtss2si'  vcvtss2si'H vcvtss2siQH VPDPBUUDS VPDPBUUDS vpdpbuudsZPacked Dot Product of Unsigned-by-Unsinged Byte subvectors into Doubleword with Saturation vpdpbuudsX vpdpbuuds/X vpdpbuudsX vpdpbuuds2XVPHSUBBWVPHSUBBWvphsubbw5Packed Horizontal Subtract Signed Byte to Signed Wordvphsubbw"vphsubbw/"VSQRTPSVSQRTPSvsqrtpsECompute Square Roots of Packed Single-Precision Floating-Point Valuesvsqrtps9Hvsqrtps:Hvsqrtps;HvsqrtpsHvsqrtpsHvsqrtpsHvsqrtps9Hvsqrtps vsqrtpsHvsqrtps/ vsqrtps:Hvsqrtps vsqrtpsHvsqrtps2 vsqrtps;HvsqrtpsHvsqrtpsQHvsqrtpsQH VPBLENDMB VPBLENDMB vpblendmb*Blend Byte Vectors Using an OpMask Control  vpblendmbI vpblendmb/I vpblendmbI vpblendmb2I vpblendmbI vpblendmb5I vpblendmbI vpblendmb/I vpblendmbI vpblendmb2I vpblendmbI vpblendmb5I5https://www.felixcloutier.com/x86/vpblendmb:vpblendmwVPMULLQVPMULLQvpmullq=Multiply Packed Signed Quadword Integers and Store Low Result vpmullq=JvpmullqJvpmullq?JvpmullqJvpmullqAJvpmullqJvpmullq=JvpmullqJvpmullq?JvpmullqJvpmullqAJvpmullqJJPOJPOjpoJump if parity odd (PF == 0)jpoJPCNjpoJPCOMOVNTPDMOVNTPDmovntpdKStore Packed Double-Precision Floating-Point Values Using Non-Temporal HintmovntpdMOVNTPD/)https://www.felixcloutier.com/x86/movntpdVFMSUBADD213PSVFMSUBADD213PSvfmsubadd213psXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubadd213ps9Hvfmsubadd213psHvfmsubadd213ps:Hvfmsubadd213psHvfmsubadd213ps;Hvfmsubadd213psHvfmsubadd213ps9Hvfmsubadd213ps#vfmsubadd213psHvfmsubadd213ps/#vfmsubadd213ps:Hvfmsubadd213ps#vfmsubadd213psHvfmsubadd213ps2#vfmsubadd213ps;Hvfmsubadd213psHvfmsubadd213psQHvfmsubadd213psQHNhttps://www.felixcloutier.com/x86/vfmsubadd132ps:vfmsubadd213ps:vfmsubadd231ps VRSQRT28SS VRSQRT28SS vrsqrt28ssApproximation to the Reciprocal Square Root of a Scalar Single-Precision Floating-Point Value with Less Than 2^-28 Relative Error vrsqrt28ssM vrsqrt28ss'M vrsqrt28ssM vrsqrt28ss'M vrsqrt28ssRM vrsqrt28ssRM,https://www.felixcloutier.com/x86/vrsqrt28ssPMULHUWPMULHUWpmulhuw<Multiply Packed Unsigned Word Integers and Store High Resultpmulhuw pmulhuw+ pmulhuwpmulhuw/)https://www.felixcloutier.com/x86/pmulhuwVPSUBQVPSUBQvpsubq!Subtract Packed Quadword Integersvpsubq=HvpsubqHvpsubq?HvpsubqHvpsubqAHvpsubqHvpsubq=Hvpsubq vpsubqHvpsubq/ vpsubq?Hvpsubq!vpsubqHvpsubq2!vpsubqAHvpsubqHMULMULmulUnsigned MultiplymulbMULB mulwMULW mullMULLmulbMULB#mulwMULW$mullMULL'%https://www.felixcloutier.com/x86/mulVRCPPHVRCPPHvrcpphNCompute Approximate Reciprocals of Packed Half-Precision Floating-Point Values vrcpph<Kvrcpph>Kvrcpph@RvrcpphKvrcpphKvrcpphRvrcpph<KvrcpphKvrcpph>KvrcpphKvrcpph@RvrcpphR(https://www.felixcloutier.com/x86/vrcpphVUCOMISSVUCOMISSvucomissNUnordered Compare Scalar Single-Precision Floating-Point Values and Set EFLAGSvucomiss vucomissHvucomiss' vucomiss'HvucomissRHSETGESETGEsetge'Set byte if greater or equal (SF == OF)setgeSETGE setgeSETGE# VPERMIL2PS VPERMIL2PS vpermil2ps:Permute Two-Source Single-Precision Floating-Point Vectors vpermil2ps" vpermil2ps/" vpermil2ps/" vpermil2ps" vpermil2ps2" vpermil2ps2"AAMAAMaamASCII Adjust AX After MultiplyaamAAMaamAAM%https://www.felixcloutier.com/x86/aam VBLENDVPD VBLENDVPD vblendvpd= Variable Blend Packed Double Precision Floating-Point Values vblendvpd  vblendvpd/  vblendvpd  vblendvpd2 VRCP28SDVRCP28SDvrcp28sduApproximation to the Reciprocal of a Scalar Double-Precision Floating-Point Value with Less Than 2^-28 Relative Errorvrcp28sdMvrcp28sd+Mvrcp28sdMvrcp28sd+Mvrcp28sdRMvrcp28sdRM*https://www.felixcloutier.com/x86/vrcp28sd PCLMULQDQ PCLMULQDQ pclmulqdq"Carry-Less Quadword Multiplication pclmulqdq& pclmulqdq/&+https://www.felixcloutier.com/x86/pclmulqdqCMOVNOCMOVNOcmovnoMove if not overflow (OF == 0)cmovnow  cmovnow $cmovnolcmovnol'JNLEJNLEjnle0Jump if not less or equal (ZF == 0 and SF == OF)jnleJGTNjnleJGTOPFNACCPFNACCpfnacc)Packed Floating-Point Negative Accumulatepfnaccpfnacc+DASDASdas#Decimal Adjust AL after SubtractiondasDAS%https://www.felixcloutier.com/x86/dasCMOVNLCMOVNLcmovnlMove if not less (SF == OF)cmovnlw  cmovnlw $cmovnllcmovnll'PMOVZXWQPMOVZXWQpmovzxwqBMove Packed Word Integers to Quadword Integers with Zero Extensionpmovzxwqpmovzxwq'KUNPCKBWKUNPCKBWkunpckbw!Unpack and Interleave 8-bit MaskskunpckbwH<https://www.felixcloutier.com/x86/kunpckbw:kunpckwd:kunpckdq PREFETCHT2 PREFETCHT2 prefetcht2'Prefetch Data Into Caches using T2 Hint prefetcht2 PREFETCHT2#  VFNMSUB213SS VFNMSUB213SS vfnmsub213ssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfnmsub213ssH vfnmsub213ss'H vfnmsub213ss# vfnmsub213ssH vfnmsub213ss'# vfnmsub213ss'H vfnmsub213ssQH vfnmsub213ssQHHhttps://www.felixcloutier.com/x86/vfnmsub132ss:vfnmsub213ss:vfnmsub231ssVPAVGBVPAVGBvpavgbAverage Packed Byte IntegersvpavgbIvpavgb/IvpavgbIvpavgb2IvpavgbIvpavgb5Ivpavgb vpavgbIvpavgb/ vpavgb/Ivpavgb!vpavgbIvpavgb2!vpavgb2IvpavgbIvpavgb5I VPUNPCKLBW VPUNPCKLBW vpunpcklbw0Unpack and Interleave Low-Order Bytes into Words vpunpcklbwI vpunpcklbw/I vpunpcklbwI vpunpcklbw2I vpunpcklbwI vpunpcklbw5I vpunpcklbw  vpunpcklbwI vpunpcklbw/  vpunpcklbw/I vpunpcklbw! vpunpcklbwI vpunpcklbw2! vpunpcklbw2I vpunpcklbwI vpunpcklbw5IMULSDMULSDmulsd6Multiply Scalar Double-Precision Floating-Point ValuesmulsdMULSDmulsdMULSD+'https://www.felixcloutier.com/x86/mulsdVPERMWVPERMWvpermwPermute Word Integers vpermwIvpermw/IvpermwIvpermw2IvpermwIvpermw5IvpermwIvpermw/IvpermwIvpermw2IvpermwIvpermw5I/https://www.felixcloutier.com/x86/vpermd:vpermwVPHADDWVPHADDWvphaddw#Packed Horizontal Add Word Integersvphaddw vphaddw/ vphaddw!vphaddw2! VFNMSUBPS VFNMSUBPS vfnmsubpsQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Values vfnmsubps$ vfnmsubps/$ vfnmsubps/$ vfnmsubps$ vfnmsubps2$ vfnmsubps2$VPANDNDVPANDNDvpandnd5Bitwise Logical AND NOT of Packed Doubleword Integers vpandnd9HvpandndHvpandnd:HvpandndHvpandnd;HvpandndHvpandnd9HvpandndHvpandnd:HvpandndHvpandnd;HvpandndHVSCATTERPF0DPDVSCATTERPF0DPDvscatterpf0dpd„Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint with Intent to Writevscatterpf0dpdGL]https://www.felixcloutier.com/x86/vscatterpf0dps:vscatterpf0qps:vscatterpf0dpd:vscatterpf0qpdVCVTNEEBF162PSVCVTNEEBF162PSvcvtneebf162ps:Convert Even Elements of Packed BF16 Values to FP32 Valuesvcvtneebf162ps/Zvcvtneebf162ps2ZVPCOMBVPCOMBvpcomb#Compare Packed Signed Byte Integersvpcomb"vpcomb/" VSHA512MSG1 VSHA512MSG1 vsha512msg1NPerform an Intermediate Calculation for the Next Four SHA512 Message Quadwords vsha512msg1)JGJGjg&Jump if greater (ZF == 0 and SF == OF)jgJGTNjgJGTOKUNPCKDQKUNPCKDQkunpckdq"Unpack and Interleave 32-bit MaskskunpckdqI<https://www.felixcloutier.com/x86/kunpckbw:kunpckwd:kunpckdqVPMOVW2MVPMOVW2Mvpmovw2m3Move Signs of Packed Word Integers to Mask Registervpmovw2mIvpmovw2mIvpmovw2mIEhttps://www.felixcloutier.com/x86/vpmovb2m:vpmovw2m:vpmovd2m:vpmovq2mVRCPSHVRCPSHvrcpshMCompute Approximate Reciprocal of Scalar Half-Precision Floating-Point ValuesvrcpshRvrcpsh$RvrcpshRvrcpsh$R(https://www.felixcloutier.com/x86/vrcpshPSUBBPSUBBpsubbSubtract Packed Byte IntegerspsubbPSUBB psubbPSUBB+ psubbPSUBBpsubbPSUBB/3https://www.felixcloutier.com/x86/psubb:psubw:psubdVRCP14SSVRCP14SSvrcp14ssPCompute Approximate Reciprocal of a Scalar Single-Precision Floating-Point Valuevrcp14ssHvrcp14ss'Hvrcp14ssHvrcp14ss'H*https://www.felixcloutier.com/x86/vrcp14ssCRC32CRC32crc32Accumulate CRC32 Valuecrc32b crc32w crc32lcrc32b#crc32w$crc32l''https://www.felixcloutier.com/x86/crc32SETNLSETNLsetnlSet byte if not less (SF == OF)setnlSETGE setnlSETGE#VPMINSQVPMINSQvpminsq*Minimum of Packed Signed Quadword Integers vpminsq=HvpminsqHvpminsq?HvpminsqHvpminsqAHvpminsqHvpminsq=HvpminsqHvpminsq?HvpminsqHvpminsqAHvpminsqHLFENCELFENCElfence Load FencelfenceLFENCE(https://www.felixcloutier.com/x86/lfenceVPERMT2WVPERMT2Wvpermt2w9Full Permute of Words From Two Tables Overwriting a Table vpermt2wIvpermt2w/Ivpermt2wIvpermt2w2Ivpermt2wIvpermt2w5Ivpermt2wIvpermt2w/Ivpermt2wIvpermt2w2Ivpermt2wIvpermt2w5IPhttps://www.felixcloutier.com/x86/vpermt2w:vpermt2d:vpermt2q:vpermt2ps:vpermt2pdVPMULHWVPMULHWvpmulhw:Multiply Packed Signed Word Integers and Store High ResultvpmulhwIvpmulhw/IvpmulhwIvpmulhw2IvpmulhwIvpmulhw5Ivpmulhw vpmulhwIvpmulhw/ vpmulhw/Ivpmulhw!vpmulhwIvpmulhw2!vpmulhw2IvpmulhwIvpmulhw5I MASKMOVDQU MASKMOVDQU maskmovdqu'Store Selected Bytes of Double Quadword maskmovdqu,https://www.felixcloutier.com/x86/maskmovdquVPMOVQWVPMOVQWvpmovqwBDown Convert Packed Quadword Values to Word Values with Truncation vpmovqwHvpmovqw(HvpmovqwHvpmovqw,HvpmovqwHvpmovqw0HvpmovqwHvpmovqwHvpmovqwHvpmovqw'Hvpmovqw+Hvpmovqw/H<https://www.felixcloutier.com/x86/vpmovqw:vpmovsqw:vpmovusqwCOMISDCOMISDcomisdLCompare Scalar Ordered Double-Precision Floating-Point Values and Set EFLAGScomisdCOMISDcomisdCOMISD+(https://www.felixcloutier.com/x86/comisdVPROLVQVPROLVQvprolvq$Variable Rotate Packed Quadword Left vprolvq=HvprolvqHvprolvq?HvprolvqHvprolvqAHvprolvqHvprolvq=HvprolvqHvprolvq?HvprolvqHvprolvqAHvprolvqH?https://www.felixcloutier.com/x86/vprold:vprolvd:vprolq:vprolvq VEXTRACTF128 VEXTRACTF128 vextractf128$Extract Packed Floating-Point Values vextractf128  vextractf128/ fhttps://www.felixcloutier.com/x86/vextractf128:vextractf32x4:vextractf64x2:vextractf32x8:vextractf64x4PMULDQPMULDQpmuldqDMultiply Packed Signed Doubleword Integers and Store Quadword Resultpmuldqpmuldq/(https://www.felixcloutier.com/x86/pmuldqSETNGSETNGsetng-Set byte if not greater (ZF == 1 or SF != OF)setngSETLE setngSETLE#VPSHLDDVPSHLDDvpshldd9Concatenate and Shift Packed Doubleword Data Left Logical vpshldd9KvpshlddKvpshldd:KvpshlddKvpshldd;UvpshlddUvpshldd9KvpshlddKvpshldd:KvpshlddKvpshldd;UvpshlddU VSHUFF32X4 VSHUFF32X4 vshuff32x4=Shuffle 128-Bit Packed Single-Precision Floating-Point Values vshuff32x4:H vshuff32x4H vshuff32x4;H vshuff32x4H vshuff32x4:H vshuff32x4H vshuff32x4;H vshuff32x4HVMOVNTDQVMOVNTDQvmovntdq-Store Double Quadword Using Non-Temporal Hintvmovntdq/ vmovntdq/Hvmovntdq2 vmovntdq2Hvmovntdq5HVMAXPDVMAXPDvmaxpd<Return Maximum Packed Double-Precision Floating-Point Valuesvmaxpd=HvmaxpdHvmaxpd?HvmaxpdHvmaxpdAHvmaxpdHvmaxpd=Hvmaxpd vmaxpdHvmaxpd/ vmaxpd?Hvmaxpd vmaxpdHvmaxpd2 vmaxpdAHvmaxpdHvmaxpdRHvmaxpdRH VPBLENDVB VPBLENDVB vpblendvbVariable Blend Packed Bytes vpblendvb  vpblendvb/  vpblendvb! vpblendvb2!VPINSRWVPINSRWvpinsrw Insert Wordvpinsrw vpinsrwIvpinsrw$ vpinsrw$ICMOVNCCMOVNCcmovncMove if not carry (CF == 0)cmovncw  cmovncw $cmovnclcmovncl' VUNPCKHPD VUNPCKHPD vunpckhpdHUnpack and Interleave High Packed Double-Precision Floating-Point Values vunpckhpd=H vunpckhpdH vunpckhpd?H vunpckhpdH vunpckhpdAH vunpckhpdH vunpckhpd=H vunpckhpd  vunpckhpdH vunpckhpd/  vunpckhpd?H vunpckhpd  vunpckhpdH vunpckhpd2  vunpckhpdAH vunpckhpdHSETASETAseta'Set byte if above (CF == 0 and ZF == 0)setaSETHI setaSETHI#JLEJLEjle+Jump if less or equal (ZF == 1 or SF != OF)jleJLENjleJLEOKANDQKANDQkandq Bitwise Logical AND 64-bit MaskskandqI9https://www.felixcloutier.com/x86/kandw:kandb:kandq:kandd VMOVSLDUP VMOVSLDUP vmovsldup'Move Packed Single-FP Low and Duplicate vmovsldupH vmovsldupH vmovsldupH vmovsldup/H vmovsldup2H vmovsldup5H vmovsldup  vmovsldupH vmovsldup/  vmovsldup/H vmovsldup  vmovsldupH vmovsldup2  vmovsldup2H vmovsldupH vmovsldup5H VPERMI2PS VPERMI2PS vpermi2ps\Full Permute of Single-Precision Floating-Point Values From Two Tables Overwriting the Index  vpermi2ps9H vpermi2psH vpermi2ps:H vpermi2psH vpermi2ps;H vpermi2psH vpermi2ps9H vpermi2psH vpermi2ps:H vpermi2psH vpermi2ps;H vpermi2psHPhttps://www.felixcloutier.com/x86/vpermi2w:vpermi2d:vpermi2q:vpermi2ps:vpermi2pdSHRXSHRXshrx+Logical Shift Right Without Affecting Flagsshrxl5shrxl'50https://www.felixcloutier.com/x86/sarx:shlx:shrx VFPCLASSSD VFPCLASSSD vfpclasssd:Test Class of Scalar Double-Precision Floating-Point Value vfpclasssdJ vfpclasssdJ vfpclasssd+J vfpclasssd+J,https://www.felixcloutier.com/x86/vfpclasssdJNSJNSjnsJump if not sign (SF == 0)jnsJPLNjnsJPLO VCVTUSI2SD VCVTUSI2SD vcvtusi2sdHConvert Unsigned Integer to Scalar Double-Precision Floating-Point Value vcvtusi2sdlH vcvtusi2sdl'H,https://www.felixcloutier.com/x86/vcvtusi2sdUNPCKLPDUNPCKLPDunpcklpdGUnpack and Interleave Low Packed Double-Precision Floating-Point ValuesunpcklpdUNPCKLPDunpcklpdUNPCKLPD/*https://www.felixcloutier.com/x86/unpcklpd VFCMADDCSH VFCMADDCSH vfcmaddcshSFused Conjugate Multiply-Add of Complex Scalar Half-Precision Floating-Point Values vfcmaddcshR vfcmaddcsh'R vfcmaddcshR vfcmaddcsh'R vfcmaddcshQR vfcmaddcshQR6https://www.felixcloutier.com/x86/vfcmaddcsh:vfmaddcshDPPSDPPSdpps<Dot Product of Packed Single Precision Floating-Point Valuesdppsdpps/&https://www.felixcloutier.com/x86/dppsJOJOjoJump if overflow (OF == 1)joJOSNjoJOSOKTESTBKTESTBktestb"Bit Test 8-bit Masks and Set FlagsktestbJ=https://www.felixcloutier.com/x86/ktestw:ktestb:ktestq:ktestd VGETEXPSD VGETEXPSD vgetexpsdiExtract Exponent of Scalar Double-Precision Floating-Point Value as Double-Precision Floating-Point Value vgetexpsdH vgetexpsd+H vgetexpsdH vgetexpsd+H vgetexpsdRH vgetexpsdRH+https://www.felixcloutier.com/x86/vgetexpsdVLDMXCSRVLDMXCSRvldmxcsrLoad MXCSR Registervldmxcsr' VPROTWVPROTWvprotwPacked Rotate Wordsvprotw"vprotw"vprotw/"vprotw/"vprotw/"VPDPWSUDVPDPWSUDvpdpwsudHPacked Dot Product of Signed-by-Unsigned Word subvectors into DoublewordvpdpwsudYvpdpwsud/YvpdpwsudYvpdpwsud2YMINSDMINSDminsd;Return Minimum Scalar Double-Precision Floating-Point ValueminsdMINSDminsdMINSD+'https://www.felixcloutier.com/x86/minsdJBJBjbJump if below (CF == 1)jbJCSNjbJCSO PMADDUBSW PMADDUBSW pmaddubsw9Multiply and Add Packed Signed and Unsigned Byte Integers pmaddubsw pmaddubsw+ pmaddubsw pmaddubsw/+https://www.felixcloutier.com/x86/pmaddubswVPERMPSVPERMPSvpermps0Permute Single-Precision Floating-Point Elements vpermps:HvpermpsHvpermps;HvpermpsHvpermps:Hvpermps!vpermpsHvpermps2!vpermps;HvpermpsH)https://www.felixcloutier.com/x86/vpermps VCVTPH2PS VCVTPH2PS vcvtph2ps>Convert Half-Precision FP Values to Single-Precision FP Values vcvtph2psH vcvtph2psH vcvtph2psH vcvtph2ps+H vcvtph2ps/H vcvtph2ps2H vcvtph2ps% vcvtph2psH vcvtph2ps+% vcvtph2ps+H vcvtph2ps% vcvtph2psH vcvtph2ps/% vcvtph2ps/H vcvtph2psH vcvtph2ps2H vcvtph2psRH vcvtph2psRH6https://www.felixcloutier.com/x86/vcvtph2ps:vcvtph2psxAAAAAAaaaASCII Adjust After AdditionaaaAAA%https://www.felixcloutier.com/x86/aaaVFMADDPSVFMADDPSvfmaddpsCFused Multiply-Add of Packed Single-Precision Floating-Point Valuesvfmaddps$vfmaddps/$vfmaddps/$vfmaddps$vfmaddps2$vfmaddps2$MOVUPSMOVUPSmovups<Move Unaligned Packed Single-Precision Floating-Point ValuesmovupsMOVUPSmovupsMOVUPS/movupsMOVUPS/(https://www.felixcloutier.com/x86/movupsPEXTPEXTpextParallel Bits Extractpextl5pextl'5&https://www.felixcloutier.com/x86/pext VFMADDCPH VFMADDCPH vfmaddcphIFused Multiply-Add of Complex Packed Half-Precision Floating-Point Values vfmaddcph9K vfmaddcphK vfmaddcph:K vfmaddcphK vfmaddcph;R vfmaddcphR vfmaddcph9K vfmaddcphK vfmaddcph:K vfmaddcphK vfmaddcph;R vfmaddcphR vfmaddcphQR vfmaddcphQR6https://www.felixcloutier.com/x86/vfcmaddcph:vfmaddcphVPSHRDVWVPSHRDVWvpshrdvw=Concatenate and Variable Shift Packed Word Data Right Logical vpshrdvwKvpshrdvw/KvpshrdvwKvpshrdvw2KvpshrdvwUvpshrdvw5UvpshrdvwKvpshrdvw/KvpshrdvwKvpshrdvw2KvpshrdvwUvpshrdvw5UINSERTQINSERTQinsertq Insert FieldinsertqinsertqCMOVACMOVAcmova#Move if above (CF == 0 and ZF == 0)cmovaw  cmovaw $cmovalcmoval' VAESDECLAST VAESDECLAST vaesdeclast,Perform Last Round of an AES Decryption Flow  vaesdeclast  vaesdeclastK vaesdeclast/  vaesdeclast/K vaesdeclast vaesdeclastK vaesdeclast2 vaesdeclast2K vaesdeclastH vaesdeclast5HVDIVSHVDIVSHvdivsh2Divide Scalar Half-Precision Floating-Point ValuesvdivshRvdivsh$RvdivshRvdivsh$RvdivshQRvdivshQR(https://www.felixcloutier.com/x86/vdivsh VGATHERPF0QPD VGATHERPF0QPD vgatherpf0qpdmSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint vgatherpf0qpdMLYhttps://www.felixcloutier.com/x86/vgatherpf0dps:vgatherpf0qps:vgatherpf0dpd:vgatherpf0qpd VINSERTI64X2 VINSERTI64X2 vinserti64x21Insert 128 Bits of Packed Quadword Integer Values vinserti64x2J vinserti64x2/J vinserti64x2J vinserti64x2/J vinserti64x2J vinserti64x2/J vinserti64x2J vinserti64x2/JVPCMPEQWVPCMPEQWvpcmpeqw%Compare Packed Word Data for EqualityvpcmpeqwIvpcmpeqwIvpcmpeqw/Ivpcmpeqw/IvpcmpeqwIvpcmpeqwIvpcmpeqw2Ivpcmpeqw2IvpcmpeqwIvpcmpeqwIvpcmpeqw5Ivpcmpeqw5Ivpcmpeqw vpcmpeqw/ vpcmpeqw!vpcmpeqw2!KMOVWKMOVWkmovwMove 16-bit MaskkmovwHkmovwHkmovw$HkmovwHkmovw$H9https://www.felixcloutier.com/x86/kmovw:kmovb:kmovq:kmovdMOVAPSMOVAPSmovaps:Move Aligned Packed Single-Precision Floating-Point ValuesmovapsMOVAPSmovapsMOVAPS/movapsMOVAPS/(https://www.felixcloutier.com/x86/movapsCVTPS2PICVTPS2PIcvtps2piBConvert Packed Single-Precision FP Values to Packed Dword Integerscvtps2piCVTPS2PLcvtps2piCVTPS2PL+*https://www.felixcloutier.com/x86/cvtps2piBEXTRBEXTRbextrBit Field Extractbextrl6bextrl4bextrl'6bextrl'4'https://www.felixcloutier.com/x86/bextrPHSUBWPHSUBWphsubw(Packed Horizontal Subtract Word Integersphsubwphsubw+phsubwphsubw//https://www.felixcloutier.com/x86/phsubw:phsubd PREFETCHT1 PREFETCHT1 prefetcht1'Prefetch Data Into Caches using T1 Hint prefetcht1 PREFETCHT1#  VGATHERQPD VGATHERQPD vgatherqpdRGather Packed Double-Precision Floating-Point Values Using Signed Quadword Indices vgatherqpdDH vgatherqpdHH vgatherqpdLH vgatherqpdD! vgatherqpdH!7https://www.felixcloutier.com/x86/vgatherqps:vgatherqpd VPDPBUSDS VPDPBUSDS vpdpbusdsXPacked Dot Product of Unsigned-by-Singed Byte subvectors into Doubleword with Saturation vpdpbusds9K vpdpbusdsK vpdpbusds:K vpdpbusdsK vpdpbusds;V vpdpbusdsV vpdpbusds9K vpdpbusdsW vpdpbusdsK vpdpbusds/W vpdpbusds:K vpdpbusdsW vpdpbusdsK vpdpbusds2W vpdpbusds;V vpdpbusdsV+https://www.felixcloutier.com/x86/vpdpbusdsINT3INT3int3Interrupt 3 (debug trap)int35https://www.felixcloutier.com/x86/intn:into:int3:int1 VCVTSD2SS VCVTSD2SS vcvtsd2ssLConvert Scalar Double-Precision FP Value to Scalar Single-Precision FP Value vcvtsd2ssH vcvtsd2ss+H vcvtsd2ss  vcvtsd2ssH vcvtsd2ss+  vcvtsd2ss+H vcvtsd2ssQH vcvtsd2ssQHVPMULLWVPMULLWvpmullw9Multiply Packed Signed Word Integers and Store Low ResultvpmullwIvpmullw/IvpmullwIvpmullw2IvpmullwIvpmullw5Ivpmullw vpmullwIvpmullw/ vpmullw/Ivpmullw!vpmullwIvpmullw2!vpmullw2IvpmullwIvpmullw5IPSRLQPSRLQpsrlq(Shift Packed Quadword Data Right Logicalpsrlq psrlq psrlq+ psrlqpsrlqpsrlq/3https://www.felixcloutier.com/x86/psrlw:psrld:psrlqAASAASaas!ASCII Adjust AL After SubtractionaasAAS%https://www.felixcloutier.com/x86/aas VRNDSCALEPS VRNDSCALEPS vrndscaleps^Round Packed Single-Precision Floating-Point Values To Include A Given Number Of Fraction Bits vrndscaleps9H vrndscaleps:H vrndscaleps;H vrndscalepsH vrndscalepsH vrndscalepsH vrndscaleps9H vrndscalepsH vrndscaleps:H vrndscalepsH vrndscaleps;H vrndscalepsH vrndscalepsRH vrndscalepsRH-https://www.felixcloutier.com/x86/vrndscalepsVPMINSWVPMINSWvpminsw&Minimum of Packed Signed Word IntegersvpminswIvpminsw/IvpminswIvpminsw2IvpminswIvpminsw5Ivpminsw vpminswIvpminsw/ vpminsw/Ivpminsw!vpminswIvpminsw2!vpminsw2IvpminswIvpminsw5IAXORAXORaxorAtomically XORaxor'VBROADCASTF128VBROADCASTF128vbroadcastf128(Broadcast 128 Bit of Floating-Point Datavbroadcastf128/  VCVTPD2DQ VCVTPD2DQ vcvtpd2dqBConvert Packed Double-Precision FP Values to Packed Dword Integers vcvtpd2dqx=H vcvtpd2dqy?H vcvtpd2dqAH vcvtpd2dqxH vcvtpd2dqyH vcvtpd2dqH vcvtpd2dqx=H vcvtpd2dqy?H vcvtpd2dqx  vcvtpd2dqxH vcvtpd2dqy  vcvtpd2dqyH vcvtpd2dqx/  vcvtpd2dqy2  vcvtpd2dqAH vcvtpd2dqH vcvtpd2dqQH vcvtpd2dqQH VFMSUB231PS VFMSUB231PS vfmsub231psHFused Multiply-Subtract of Packed Single-Precision Floating-Point Values vfmsub231ps9H vfmsub231psH vfmsub231ps:H vfmsub231psH vfmsub231ps;H vfmsub231psH vfmsub231ps9H vfmsub231ps# vfmsub231psH vfmsub231ps/# vfmsub231ps:H vfmsub231ps# vfmsub231psH vfmsub231ps2# vfmsub231ps;H vfmsub231psH vfmsub231psQH vfmsub231psQHEhttps://www.felixcloutier.com/x86/vfmsub132ps:vfmsub213ps:vfmsub231psMOVHPSMOVHPSmovhps7Move High Packed Single-Precision Floating-Point ValuesmovhpsMOVHPS+movhpsMOVHPS+(https://www.felixcloutier.com/x86/movhpsVPDPWSSDVPDPWSSDvpdpwssdFPacked Dot Product of Signed-by-Signed Word subvectors into Doublewordvpdpwssd9KvpdpwssdKvpdpwssd:KvpdpwssdKvpdpwssd;VvpdpwssdVvpdpwssd9KvpdpwssdWvpdpwssdKvpdpwssd/Wvpdpwssd:KvpdpwssdWvpdpwssdKvpdpwssd2Wvpdpwssd;VvpdpwssdV*https://www.felixcloutier.com/x86/vpdpwssdVPHSUBSWVPHSUBSWvphsubswFPacked Horizontal Subtract Signed Word Integers with Signed Saturationvphsubsw vphsubsw/ vphsubsw!vphsubsw2!KNOTBKNOTBknotbNOT 8-bit Mask RegisterknotbJ9https://www.felixcloutier.com/x86/knotw:knotb:knotq:knotdPF2IWPF2IWpf2iw0Packed Floating-Point to Integer Word Conversionpf2iwpf2iw+ VFIXUPIMMSS VFIXUPIMMSS vfixupimmss;Fix Up Special Scalar Single-Precision Floating-Point Value vfixupimmssH vfixupimmss'H vfixupimmssH vfixupimmss'H vfixupimmssRH vfixupimmssRH-https://www.felixcloutier.com/x86/vfixupimmssGF2P8AFFINEINVQBGF2P8AFFINEINVQBgf2p8affineinvqb0Galois Field (2^8) Affine Inverse Transformationgf2p8affineinvqbgf2p8affineinvqb/2https://www.felixcloutier.com/x86/gf2p8affineinvqbMOVSDMOVSDmovsd1Move Scalar Double-Precision Floating-Point ValuemovsdMOVSDmovsdMOVSD+movsdMOVSD+'https://www.felixcloutier.com/x86/movsdVBROADCASTI64X2VBROADCASTI64X2vbroadcasti64x2Broadcast Two Quadword Elementsvbroadcasti64x2/Jvbroadcasti64x2/Jvbroadcasti64x2/Jvbroadcasti64x2/JVMAXPSVMAXPSvmaxps<Return Maximum Packed Single-Precision Floating-Point Valuesvmaxps9HvmaxpsHvmaxps:HvmaxpsHvmaxps;HvmaxpsHvmaxps9Hvmaxps vmaxpsHvmaxps/ vmaxps:Hvmaxps vmaxpsHvmaxps2 vmaxps;HvmaxpsHvmaxpsRHvmaxpsRHCVTPD2DQCVTPD2DQcvtpd2dqBConvert Packed Double-Precision FP Values to Packed Dword Integerscvtpd2dqcvtpd2dq/*https://www.felixcloutier.com/x86/cvtpd2dqVPMOVSDBVPMOVSDBvpmovsdbKDown Convert Packed Doubleword Values to Byte Values with Signed Saturation vpmovsdbHvpmovsdb(HvpmovsdbHvpmovsdb,HvpmovsdbHvpmovsdb0HvpmovsdbHvpmovsdbHvpmovsdbHvpmovsdb'Hvpmovsdb+Hvpmovsdb/H<https://www.felixcloutier.com/x86/vpmovdb:vpmovsdb:vpmovusdb VFNMSUB132PD VFNMSUB132PD vfnmsub132pdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Values vfnmsub132pd=H vfnmsub132pdH vfnmsub132pd?H vfnmsub132pdH vfnmsub132pdAH vfnmsub132pdH vfnmsub132pd=H vfnmsub132pd# vfnmsub132pdH vfnmsub132pd/# vfnmsub132pd?H vfnmsub132pd# vfnmsub132pdH vfnmsub132pd2# vfnmsub132pdAH vfnmsub132pdH vfnmsub132pdQH vfnmsub132pdQHHhttps://www.felixcloutier.com/x86/vfnmsub132pd:vfnmsub213pd:vfnmsub231pd VCVTPH2UW VCVTPH2UW vcvtph2uwZConvert Packed Half-Precision Floating-Point Values to Packed Unsigned Word Integer Values vcvtph2uw<K vcvtph2uw>K vcvtph2uw@R vcvtph2uwK vcvtph2uwK vcvtph2uwR vcvtph2uw<K vcvtph2uwK vcvtph2uw>K vcvtph2uwK vcvtph2uw@R vcvtph2uwR vcvtph2uwQR vcvtph2uwQR+https://www.felixcloutier.com/x86/vcvtph2uw VINSERTI32X4 VINSERTI32X4 vinserti32x43Insert 128 Bits of Packed Doubleword Integer Values vinserti32x4H vinserti32x4/H vinserti32x4H vinserti32x4/H vinserti32x4H vinserti32x4/H vinserti32x4H vinserti32x4/H VPERMILPD VPERMILPD vpermilpd.Permute Double-Precision Floating-Point Values  vpermilpd=H vpermilpd?H vpermilpdAH vpermilpd=H vpermilpdH vpermilpdH vpermilpd?H vpermilpdH vpermilpdH vpermilpdAH vpermilpdH vpermilpdH vpermilpd=H vpermilpd=H vpermilpd  vpermilpdH vpermilpd  vpermilpdH vpermilpd/  vpermilpd/  vpermilpd?H vpermilpd?H vpermilpd  vpermilpdH vpermilpd  vpermilpdH vpermilpd2  vpermilpd2  vpermilpdAH vpermilpdAH vpermilpdH vpermilpdH+https://www.felixcloutier.com/x86/vpermilpdVPTESTMBVPTESTMBvptestmb6Logical AND of Packed Byte Integer Values and Set Mask vptestmbIvptestmbIvptestmb/Ivptestmb/IvptestmbIvptestmbIvptestmb2Ivptestmb2IvptestmbIvptestmbIvptestmb5Ivptestmb5IEhttps://www.felixcloutier.com/x86/vptestmb:vptestmw:vptestmd:vptestmqVMOVDQUVMOVDQUvmovdquMove Unaligned Double Quadwordvmovdqu vmovdqu/ vmovdqu vmovdqu2 vmovdqu/ vmovdqu2 VPMOVM2BVPMOVM2Bvpmovm2b4Expand Bits of Mask Register to Packed Byte Integersvpmovm2bIvpmovm2bIvpmovm2bIEhttps://www.felixcloutier.com/x86/vpmovm2b:vpmovm2w:vpmovm2d:vpmovm2qVROUNDSDVROUNDSDvroundsd3Round Scalar Double Precision Floating-Point Valuesvroundsd vroundsd+ VPDPWUUDVPDPWUUDvpdpwuudJPacked Dot Product of Unsigned-by-Unsigned Word subvectors into DoublewordvpdpwuudYvpdpwuud/YvpdpwuudYvpdpwuud2Y VMOVDQU64 VMOVDQU64 vmovdqu64Move Unaligned Quadword Values vmovdqu640H vmovdqu64H vmovdqu643H vmovdqu64H vmovdqu646H vmovdqu64H vmovdqu64/H vmovdqu642H vmovdqu645H vmovdqu64H vmovdqu64/H vmovdqu64H vmovdqu642H vmovdqu64H vmovdqu645H vmovdqu64/H vmovdqu642H vmovdqu645HOhttps://www.felixcloutier.com/x86/movdqu:vmovdqu8:vmovdqu16:vmovdqu32:vmovdqu64PANDPANDpandPacked Bitwise Logical ANDpandPAND pandPAND+ pandPANDpandPAND/&https://www.felixcloutier.com/x86/pand VPMOVUSWB VPMOVUSWB vpmovuswbGDown Convert Packed Word Values to Byte Values with Unsigned Saturation  vpmovuswbI vpmovuswb,I vpmovuswbI vpmovuswb0I vpmovuswbI vpmovuswb3I vpmovuswbI vpmovuswbI vpmovuswbI vpmovuswb+I vpmovuswb/I vpmovuswb2I<https://www.felixcloutier.com/x86/vpmovwb:vpmovswb:vpmovuswb VSCALEFSS VSCALEFSS vscalefss_Scale Scalar Single-Precision Floating-Point Value With a Single-Precision Floating-Point Value vscalefssH vscalefss'H vscalefssH vscalefss'H vscalefssQH vscalefssQH+https://www.felixcloutier.com/x86/vscalefssVMOVAPDVMOVAPDvmovapd:Move Aligned Packed Double-Precision Floating-Point Valuesvmovapd0HvmovapdHvmovapd3HvmovapdHvmovapd6HvmovapdHvmovapd/Hvmovapd2Hvmovapd5Hvmovapd vmovapdHvmovapd/ vmovapd/Hvmovapd vmovapdHvmovapd2 vmovapd2HvmovapdHvmovapd5Hvmovapd/ vmovapd/Hvmovapd2 vmovapd2Hvmovapd5HSARXSARXsarx.Arithmetic Shift Right Without Affecting Flagssarxl5sarxl'50https://www.felixcloutier.com/x86/sarx:shlx:shrx VINSERTPS VINSERTPS vinsertps3Insert Packed Single Precision Floating-Point Value vinsertps  vinsertpsH vinsertps'  vinsertps'H VPMOVSXBD VPMOVSXBD vpmovsxbdDMove Packed Byte Integers to Doubleword Integers with Sign Extension vpmovsxbdH vpmovsxbdH vpmovsxbdH vpmovsxbd'H vpmovsxbd+H vpmovsxbd/H vpmovsxbd  vpmovsxbdH vpmovsxbd'  vpmovsxbd'H vpmovsxbd! vpmovsxbdH vpmovsxbd+! vpmovsxbd+H vpmovsxbdH vpmovsxbd/HBLENDVPSBLENDVPSblendvps= Variable Blend Packed Single Precision Floating-Point Valuesblendvpsblendvps/*https://www.felixcloutier.com/x86/blendvps VFNMSUB231SS VFNMSUB231SS vfnmsub231ssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfnmsub231ssH vfnmsub231ss'H vfnmsub231ss# vfnmsub231ssH vfnmsub231ss'# vfnmsub231ss'H vfnmsub231ssQH vfnmsub231ssQHHhttps://www.felixcloutier.com/x86/vfnmsub132ss:vfnmsub213ss:vfnmsub231ssPSUBUSBPSUBUSBpsubusb?Subtract Packed Unsigned Byte Integers with Unsigned SaturationpsubusbPSUBUSB psubusbPSUBUSB+ psubusbPSUBUSBpsubusbPSUBUSB/1https://www.felixcloutier.com/x86/psubusb:psubusw VPERMILPS VPERMILPS vpermilps.Permute Single-Precision Floating-Point Values  vpermilps9H vpermilps:H vpermilps;H vpermilps9H vpermilpsH vpermilpsH vpermilps:H vpermilpsH vpermilpsH vpermilps;H vpermilpsH vpermilpsH vpermilps9H vpermilps9H vpermilps  vpermilpsH vpermilps  vpermilpsH vpermilps/  vpermilps/  vpermilps:H vpermilps:H vpermilps  vpermilpsH vpermilps  vpermilpsH vpermilps2  vpermilps2  vpermilps;H vpermilps;H vpermilpsH vpermilpsH+https://www.felixcloutier.com/x86/vpermilpsADCADCadcAdd with CarryadcbADCBadcbADCB adcbADCB  adcbADCB #adcwADCW adcwADCW adcwADCW adcwADCW  adcwADCW $adclADCLadclADCLadclADCLadclADCLadclADCL'adcbADCB#adcbADCB# adcwADCW$adcwADCW$adcwADCW$ adclADCL'adclADCL'adclADCL'%https://www.felixcloutier.com/x86/adcVPSLLDQVPSLLDQvpslldq)Shift Packed Double Quadword Left Logicalvpslldq vpslldqIvpslldq/Ivpslldq!vpslldqIvpslldq2IvpslldqIvpslldq5I VGATHERPF0DPS VGATHERPF0DPS vgatherpf0dpsoSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint vgatherpf0dpsKLYhttps://www.felixcloutier.com/x86/vgatherpf0dps:vgatherpf0qps:vgatherpf0dpd:vgatherpf0qpdVDPPSVDPPSvdpps<Dot Product of Packed Single Precision Floating-Point Valuesvdpps vdpps/ vdpps vdpps2 PFACCPFACCpfacc Packed Floating-Point Accumulatepfaccpfacc+VBROADCASTF64X2VBROADCASTF64X2vbroadcastf64x26Broadcast Two Double-Precision Floating-Point Elementsvbroadcastf64x2/Jvbroadcastf64x2/Jvbroadcastf64x2/Jvbroadcastf64x2/J PHMINPOSUW PHMINPOSUW phminposuw3Packed Horizontal Minimum of Unsigned Word Integers phminposuw phminposuw/,https://www.felixcloutier.com/x86/phminposuw VGETEXPSS VGETEXPSS vgetexpssiExtract Exponent of Scalar Single-Precision Floating-Point Value as Single-Precision Floating-Point Value vgetexpssH vgetexpss'H vgetexpssH vgetexpss'H vgetexpssRH vgetexpssRH+https://www.felixcloutier.com/x86/vgetexpssKANDNBKANDNBkandnb#Bitwise Logical AND NOT 8-bit MaskskandnbJ=https://www.felixcloutier.com/x86/kandnw:kandnb:kandnq:kandndCVTSS2SDCVTSS2SDcvtss2sdLConvert Scalar Single-Precision FP Value to Scalar Double-Precision FP Valuecvtss2sdCVTSS2SDcvtss2sdCVTSS2SD'*https://www.felixcloutier.com/x86/cvtss2sd VFMSUB132SS VFMSUB132SS vfmsub132ssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfmsub132ssH vfmsub132ss'H vfmsub132ss# vfmsub132ssH vfmsub132ss'# vfmsub132ss'H vfmsub132ssQH vfmsub132ssQHEhttps://www.felixcloutier.com/x86/vfmsub132ss:vfmsub213ss:vfmsub231ssVPADDSBVPADDSBvpaddsb6Add Packed Signed Byte Integers with Signed SaturationvpaddsbIvpaddsb/IvpaddsbIvpaddsb2IvpaddsbIvpaddsb5Ivpaddsb vpaddsbIvpaddsb/ vpaddsb/Ivpaddsb!vpaddsbIvpaddsb2!vpaddsb2IvpaddsbIvpaddsb5I VPMOVUSDW VPMOVUSDW vpmovusdwMDown Convert Packed Doubleword Values to Word Values with Unsigned Saturation  vpmovusdwH vpmovusdw,H vpmovusdwH vpmovusdw0H vpmovusdwH vpmovusdw3H vpmovusdwH vpmovusdwH vpmovusdwH vpmovusdw+H vpmovusdw/H vpmovusdw2H<https://www.felixcloutier.com/x86/vpmovdw:vpmovsdw:vpmovusdw VFNMADD231SD VFNMADD231SD vfnmadd231sdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Values vfnmadd231sdH vfnmadd231sd+H vfnmadd231sd# vfnmadd231sdH vfnmadd231sd+# vfnmadd231sd+H vfnmadd231sdQH vfnmadd231sdQHHhttps://www.felixcloutier.com/x86/vfnmadd132sd:vfnmadd213sd:vfnmadd231sdKADDDKADDDkadddADD Two 32-bit MaskskadddI9https://www.felixcloutier.com/x86/kaddw:kaddb:kaddq:kadddVPSLLWVPSLLWvpsllw#Shift Packed Word Data Left LogicalvpsllwIvpsllwIvpsllw/IvpsllwIvpsllwIvpsllw/IvpsllwIvpsllwIvpsllw/Ivpsllw/Ivpsllw2Ivpsllw5Ivpsllw vpsllwIvpsllw vpsllwIvpsllw/ vpsllw/Ivpsllw/Ivpsllw!vpsllwIvpsllw!vpsllwIvpsllw/!vpsllw/Ivpsllw2IvpsllwIvpsllwIvpsllw/Ivpsllw5I VFMSUB231PD VFMSUB231PD vfmsub231pdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Values vfmsub231pd=H vfmsub231pdH vfmsub231pd?H vfmsub231pdH vfmsub231pdAH vfmsub231pdH vfmsub231pd=H vfmsub231pd# vfmsub231pdH vfmsub231pd/# vfmsub231pd?H vfmsub231pd# vfmsub231pdH vfmsub231pd2# vfmsub231pdAH vfmsub231pdH vfmsub231pdQH vfmsub231pdQHEhttps://www.felixcloutier.com/x86/vfmsub132pd:vfmsub213pd:vfmsub231pdVPMOVSQDVPMOVSQDvpmovsqdODown Convert Packed Quadword Values to Doubleword Values with Signed Saturation vpmovsqdHvpmovsqd,HvpmovsqdHvpmovsqd0HvpmovsqdHvpmovsqd3HvpmovsqdHvpmovsqdHvpmovsqdHvpmovsqd+Hvpmovsqd/Hvpmovsqd2H<https://www.felixcloutier.com/x86/vpmovqd:vpmovsqd:vpmovusqd VRNDSCALEPD VRNDSCALEPD vrndscalepd^Round Packed Double-Precision Floating-Point Values To Include A Given Number Of Fraction Bits vrndscalepd=H vrndscalepd?H vrndscalepdAH vrndscalepdH vrndscalepdH vrndscalepdH vrndscalepd=H vrndscalepdH vrndscalepd?H vrndscalepdH vrndscalepdAH vrndscalepdH vrndscalepdRH vrndscalepdRH-https://www.felixcloutier.com/x86/vrndscalepdVPSADBWVPSADBWvpsadbw#Compute Sum of Absolute Differences vpsadbw vpsadbwIvpsadbw/ vpsadbw/Ivpsadbw!vpsadbwIvpsadbw2!vpsadbw2IvpsadbwIvpsadbw5I VFMADD213PH VFMADD213PH vfmadd213phAFused Multiply-Add of Packed Half-Precision Floating-Point Values vfmadd213ph<K vfmadd213phK vfmadd213ph>K vfmadd213phK vfmadd213ph@R vfmadd213phR vfmadd213ph<K vfmadd213phK vfmadd213ph>K vfmadd213phK vfmadd213ph@R vfmadd213phR vfmadd213phQR vfmadd213phQRlhttps://www.felixcloutier.com/x86/vfmadd132ph:vfnmadd132ph:vfmadd213ph:vfnmadd213ph:vfmadd231ph:vfnmadd231ph VSM3RNDS2 VSM3RNDS2 vsm3rnds2#Perform Two Rounds of SM3 Operation vsm3rnds2 vsm3rnds2/VSTMXCSRVSTMXCSRvstmxcsrStore MXCSR Register Statevstmxcsr' SETLESETLEsetle/Set byte if less or equal (ZF == 1 or SF != OF)setleSETLE setleSETLE#MULXMULXmulx)Unsigned Multiply Without Affecting Flagsmulxl5mulxl'5&https://www.felixcloutier.com/x86/mulxVPOPCNTBVPOPCNTBvpopcntb)Packed Population Count for Byte Integers vpopcntbKvpopcntbKvpopcntbSvpopcntb/Kvpopcntb2Kvpopcntb5SvpopcntbKvpopcntb/KvpopcntbKvpopcntb2KvpopcntbSvpopcntb5SVUCOMISHVUCOMISHvucomishLUnordered Compare Scalar Half-Precision Floating-Point Values and Set EFLAGSvucomishRvucomish$RvucomishRR*https://www.felixcloutier.com/x86/vucomish VBLENDMPS VBLENDMPS vblendmpsLBlend Packed Single-Precision Floating-Point Vectors Using an OpMask Control  vblendmps9H vblendmpsH vblendmps:H vblendmpsH vblendmps;H vblendmpsH vblendmps9H vblendmpsH vblendmps:H vblendmpsH vblendmps;H vblendmpsH5https://www.felixcloutier.com/x86/vblendmpd:vblendmpsXCHGXCHGxchg&Exchange Register/Memory with Register xchgbXCHGB  xchgbXCHGB #xchgwXCHGW  xchgwXCHGW  xchgwXCHGW  xchgwXCHGW $xchglXCHGLxchglXCHGLxchglXCHGLxchglXCHGL'xchgbXCHGB# xchgwXCHGW$ xchglXCHGL'&https://www.felixcloutier.com/x86/xchgCMOVAECMOVAEcmovae Move if above or equal (CF == 0)cmovaew  cmovaew $cmovaelcmovael'MOVSLDUPMOVSLDUPmovsldup'Move Packed Single-FP Low and Duplicatemovsldupmovsldup/*https://www.felixcloutier.com/x86/movsldup VCVTPD2PS VCVTPD2PS vcvtpd2psNConvert Packed Double-Precision FP Values to Packed Single-Precision FP Values vcvtpd2psx=H vcvtpd2psy?H vcvtpd2psAH vcvtpd2psxH vcvtpd2psyH vcvtpd2psH vcvtpd2psx=H vcvtpd2psy?H vcvtpd2psx  vcvtpd2psxH vcvtpd2psy  vcvtpd2psyH vcvtpd2psx/  vcvtpd2psy2  vcvtpd2psAH vcvtpd2psH vcvtpd2psQH vcvtpd2psQH VEXTRACTF64X2 VEXTRACTF64X2 vextractf64x2AExtract 128 Bits of Packed Double-Precision Floating-Point Values vextractf64x2J vextractf64x20J vextractf64x2J vextractf64x20J vextractf64x2J vextractf64x2J vextractf64x2/J vextractf64x2/JVPRORDVPRORDvprordRotate Packed Doubleword Right vprord9Hvprord:Hvprord;HvprordHvprordHvprordHvprord9HvprordHvprord:HvprordHvprord;HvprordH?https://www.felixcloutier.com/x86/vprord:vprorvd:vprorq:vprorvq VPBLENDMQ VPBLENDMQ vpblendmq.Blend Quadword Vectors Using an OpMask Control  vpblendmq=H vpblendmqH vpblendmq?H vpblendmqH vpblendmqAH vpblendmqH vpblendmq=H vpblendmqH vpblendmq?H vpblendmqH vpblendmqAH vpblendmqH5https://www.felixcloutier.com/x86/vpblendmd:vpblendmqVMINPSVMINPSvminps<Return Minimum Packed Single-Precision Floating-Point Valuesvminps9HvminpsHvminps:HvminpsHvminps;HvminpsHvminps9Hvminps vminpsHvminps/ vminps:Hvminps vminpsHvminps2 vminps;HvminpsHvminpsRHvminpsRHVPCMPGTBVPCMPGTBvpcmpgtb4Compare Packed Signed Byte Integers for Greater ThanvpcmpgtbIvpcmpgtbIvpcmpgtb/Ivpcmpgtb/IvpcmpgtbIvpcmpgtbIvpcmpgtb2Ivpcmpgtb2IvpcmpgtbIvpcmpgtbIvpcmpgtb5Ivpcmpgtb5Ivpcmpgtb vpcmpgtb/ vpcmpgtb!vpcmpgtb2!MOVLPDMOVLPDmovlpd5Move Low Packed Double-Precision Floating-Point ValuemovlpdMOVLPD+movlpdMOVLPD+(https://www.felixcloutier.com/x86/movlpdSUBSDSUBSDsubsd6Subtract Scalar Double-Precision Floating-Point ValuessubsdSUBSDsubsdSUBSD+'https://www.felixcloutier.com/x86/subsdVAESDECVAESDECvaesdec+Perform One Round of an AES Decryption Flow vaesdec vaesdecKvaesdec/ vaesdec/KvaesdecvaesdecKvaesdec2vaesdec2KvaesdecHvaesdec5HVFMADDSUB231PSVFMADDSUB231PSvfmaddsub231psXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsub231ps9Hvfmaddsub231psHvfmaddsub231ps:Hvfmaddsub231psHvfmaddsub231ps;Hvfmaddsub231psHvfmaddsub231ps9Hvfmaddsub231ps#vfmaddsub231psHvfmaddsub231ps/#vfmaddsub231ps:Hvfmaddsub231ps#vfmaddsub231psHvfmaddsub231ps2#vfmaddsub231ps;Hvfmaddsub231psHvfmaddsub231psQHvfmaddsub231psQHNhttps://www.felixcloutier.com/x86/vfmaddsub132ps:vfmaddsub213ps:vfmaddsub231ps VPMACSDQL VPMACSDQL vpmacsdqlCPacked Multiply Accumulate Signed Low Doubleword to Signed Quadword vpmacsdql" vpmacsdql/"KXORBKXORBkxorbBitwise Logical XOR 8-bit MaskskxorbJ9https://www.felixcloutier.com/x86/kxorw:kxorb:kxorq:kxordVMULPDVMULPDvmulpd6Multiply Packed Double-Precision Floating-Point Valuesvmulpd=HvmulpdHvmulpd?HvmulpdHvmulpdAHvmulpdHvmulpd=Hvmulpd vmulpdHvmulpd/ vmulpd?Hvmulpd vmulpdHvmulpd2 vmulpdAHvmulpdHvmulpdQHvmulpdQH VCVTSH2SS VCVTSH2SS vcvtsh2ssJConvert Scalar Half-Precision FP Value to Scalar Double-Precision FP Value vcvtsh2ssR vcvtsh2ss$R vcvtsh2ssR vcvtsh2ss$R vcvtsh2ssRR vcvtsh2ssRR+https://www.felixcloutier.com/x86/vcvtsh2ss VPDPWSUDS VPDPWSUDS vpdpwsudsXPacked Dot Product of Signed-by-Unsigned Word subvectors into Doubleword with Saturation vpdpwsudsY vpdpwsuds/Y vpdpwsudsY vpdpwsuds2YVPHADDDQVPHADDDQvphadddq:Packed Horizontal Add Signed Doubleword to Signed Quadwordvphadddq"vphadddq/"PEXTRDPEXTRDpextrdExtract Doublewordpextrdpextrd'6https://www.felixcloutier.com/x86/pextrb:pextrd:pextrq VFIXUPIMMSD VFIXUPIMMSD vfixupimmsd;Fix Up Special Scalar Double-Precision Floating-Point Value vfixupimmsdH vfixupimmsd+H vfixupimmsdH vfixupimmsd+H vfixupimmsdRH vfixupimmsdRH-https://www.felixcloutier.com/x86/vfixupimmsdVPMOVM2DVPMOVM2Dvpmovm2d:Expand Bits of Mask Register to Packed Doubleword Integersvpmovm2dJvpmovm2dJvpmovm2dJEhttps://www.felixcloutier.com/x86/vpmovm2b:vpmovm2w:vpmovm2d:vpmovm2qCMOVNLECMOVNLEcmovnle0Move if not less or equal (ZF == 0 and SF == OF)cmovnlew  cmovnlew $cmovnlelcmovnlel'CVTPS2DQCVTPS2DQcvtps2dqBConvert Packed Single-Precision FP Values to Packed Dword Integerscvtps2dqcvtps2dq/*https://www.felixcloutier.com/x86/cvtps2dqCPUIDCPUIDcpuidCPU IdentificationcpuidCPUID'https://www.felixcloutier.com/x86/cpuidCMOVNBCMOVNBcmovnbMove if not below (CF == 0)cmovnbw  cmovnbw $cmovnblcmovnbl'MPSADBWMPSADBWmpsadbw3Compute Multiple Packed Sums of Absolute Differencempsadbwmpsadbw/)https://www.felixcloutier.com/x86/mpsadbwUMONITORUMONITORumonitor(User mode Monitor a Linear Address RangeumonitorG*https://www.felixcloutier.com/x86/umonitorSHRSHRshrLogical Shift RightshrbSHRB shrbSHRB shrbSHRB shrwSHRW shrwSHRW shrwSHRW shrlSHRLshrlSHRLshrlSHRLshrbSHRB#shrbSHRB#shrbSHRB#shrwSHRW$shrwSHRW$shrwSHRW$shrlSHRL'shrlSHRL'shrlSHRL'1https://www.felixcloutier.com/x86/sal:sar:shl:shr VPMASKMOVQ VPMASKMOVQ vpmaskmovq)Conditional Move Packed Quadword Integers vpmaskmovq/! vpmaskmovq2! vpmaskmovq/! vpmaskmovq2! VSCATTERDPS VSCATTERDPS vscatterdpsTScatter Packed Single-Precision Floating-Point Values with Signed Doubleword Indices vscatterdpsCH vscatterdpsGH vscatterdpsKHQhttps://www.felixcloutier.com/x86/vscatterdps:vscatterdpd:vscatterqps:vscatterqpdANDPSANDPSandpsDBitwise Logical AND of Packed Single-Precision Floating-Point ValuesandpsANDPSandpsANDPS/'https://www.felixcloutier.com/x86/andpsUCOMISSUCOMISSucomissNUnordered Compare Scalar Single-Precision Floating-Point Values and Set EFLAGSucomissUCOMISSucomissUCOMISS')https://www.felixcloutier.com/x86/ucomissVPDPWUSDVPDPWUSDvpdpwusdHPacked Dot Product of Unsigned-by-Signed Word subvectors into DoublewordvpdpwusdYvpdpwusd/YvpdpwusdYvpdpwusd2YVPSHADVPSHADvpshad#Packed Shift Arithmetic Doublewordsvpshad"vpshad/"vpshad/"VPDPBSSDVPDPBSSDvpdpbssdFPacked Dot Product of Signed-by-Singed Byte subvectors into DoublewordvpdpbssdXvpdpbssd/XvpdpbssdXvpdpbssd2XMOVDMOVDmovdMove Doublewordmovd movdmovd movd' movdmovd'movd' movd'+https://www.felixcloutier.com/x86/movd:movqSETPESETPEsetpe!Set byte if parity even (PF == 1)setpeSETPS setpeSETPS#VORPDVORPDvorpd<Bitwise Logical OR of Double-Precision Floating-Point Valuesvorpd=JvorpdJvorpd?JvorpdJvorpdAJvorpdJvorpd=Jvorpd vorpdJvorpd/ vorpd?Jvorpd vorpdJvorpd2 vorpdAJvorpdJCMOVNACMOVNAcmovna&Move if not above (CF == 1 or ZF == 1)cmovnaw  cmovnaw $cmovnalcmovnal'PI2FWPI2FWpi2fw0Packed Integer to Floating-Point Word Conversionpi2fwpi2fw+VEXP2PDVEXP2PDvexp2pdyApproximation to the Exponential 2^x of Packed Double-Precision Floating-Point Values with Less Than 2^-23 Relative Errorvexp2pdAMvexp2pdMvexp2pdAMvexp2pdMvexp2pdRMvexp2pdRM)https://www.felixcloutier.com/x86/vexp2pd VFMADD132SD VFMADD132SD vfmadd132sdCFused Multiply-Add of Scalar Double-Precision Floating-Point Values vfmadd132sdH vfmadd132sd+H vfmadd132sd# vfmadd132sdH vfmadd132sd+# vfmadd132sd+H vfmadd132sdQH vfmadd132sdQHEhttps://www.felixcloutier.com/x86/vfmadd132sd:vfmadd213sd:vfmadd231sdVPERMPDVPERMPDvpermpd0Permute Double-Precision Floating-Point Elementsvpermpd?HvpermpdAHvpermpd?HvpermpdHvpermpdHvpermpdAHvpermpdHvpermpdHvpermpd?Hvpermpd?Hvpermpd!vpermpdHvpermpdHvpermpd2!vpermpdAHvpermpdAHvpermpdHvpermpdH)https://www.felixcloutier.com/x86/vpermpdVMULSHVMULSHvmulsh:Fused Multiply Scalar Half-Precision Floating-Point ValuesvmulshRvmulsh$RvmulshRvmulsh$RvmulshQRvmulshQR(https://www.felixcloutier.com/x86/vmulsh VGATHERPF1DPD VGATHERPF1DPD vgatherpf1dpdoSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint vgatherpf1dpdGLYhttps://www.felixcloutier.com/x86/vgatherpf1dps:vgatherpf1qps:vgatherpf1dpd:vgatherpf1qpd VPCOMPRESSW VPCOMPRESSW vpcompresswBStore Sparse Packed Word Integer Values into Dense Memory/Register  vpcompressw0K vpcompresswK vpcompressw3K vpcompresswK vpcompressw6U vpcompresswU vpcompresswK vpcompresswK vpcompresswU vpcompressw/K vpcompressw2K vpcompressw5UVBCSTNEBF162PSVBCSTNEBF162PSvbcstnebf162ps;Load BF16 Element and Convert to FP32 Element With Broadcasvbcstnebf162ps$Zvbcstnebf162ps$Z PUNPCKLDQ PUNPCKLDQ punpckldq:Unpack and Interleave Low-Order Doublewords into Quadwords punpckldq  punpckldq'  punpckldq punpckldq/Jhttps://www.felixcloutier.com/x86/punpcklbw:punpcklwd:punpckldq:punpcklqdqPHADDSWPHADDSWphaddswAPacked Horizontal Add Signed Word Integers with Signed Saturationphaddswphaddsw+phaddswphaddsw/)https://www.felixcloutier.com/x86/phaddswCMOVNGCMOVNGcmovng)Move if not greater (ZF == 1 or SF != OF)cmovngw  cmovngw $cmovnglcmovngl'PAUSEPAUSEpauseSpin Loop HintpausePAUSE'https://www.felixcloutier.com/x86/pauseVPCMPGTWVPCMPGTWvpcmpgtw4Compare Packed Signed Word Integers for Greater ThanvpcmpgtwIvpcmpgtwIvpcmpgtw/Ivpcmpgtw/IvpcmpgtwIvpcmpgtwIvpcmpgtw2Ivpcmpgtw2IvpcmpgtwIvpcmpgtwIvpcmpgtw5Ivpcmpgtw5Ivpcmpgtw vpcmpgtw/ vpcmpgtw!vpcmpgtw2! VPEXPANDW VPEXPANDW vpexpandwALoad Sparse Packed Word Integer Values from Dense Memory/Register  vpexpandwK vpexpandwK vpexpandwU vpexpandw/K vpexpandw2K vpexpandw5U vpexpandwK vpexpandw/K vpexpandwK vpexpandw2K vpexpandwU vpexpandw5U5https://www.felixcloutier.com/x86/vpexpandb:vpexpandw VCVTSH2SD VCVTSH2SD vcvtsh2sdJConvert Scalar Half-Precision FP Value to Scalar Double-Precision FP Value vcvtsh2sdR vcvtsh2sd$R vcvtsh2sdR vcvtsh2sd$R vcvtsh2sdRR vcvtsh2sdRR+https://www.felixcloutier.com/x86/vcvtsh2sd VGETEXPPS VGETEXPPS vgetexppslExtract Exponents of Packed Single-Precision Floating-Point Values as Single-Precision Floating-Point Values vgetexpps9H vgetexpps:H vgetexpps;H vgetexppsH vgetexppsH vgetexppsH vgetexpps9H vgetexppsH vgetexpps:H vgetexppsH vgetexpps;H vgetexppsH vgetexppsRH vgetexppsRH+https://www.felixcloutier.com/x86/vgetexppsVPMOVQBVPMOVQBvpmovqbBDown Convert Packed Quadword Values to Byte Values with Truncation vpmovqbHvpmovqb%HvpmovqbHvpmovqb(HvpmovqbHvpmovqb,HvpmovqbHvpmovqbHvpmovqbHvpmovqb$Hvpmovqb'Hvpmovqb+H<https://www.felixcloutier.com/x86/vpmovqb:vpmovsqb:vpmovusqb PREFETCHWT1 PREFETCHWT1 prefetchwt1APrefetch Vector Data Into Caches with Intent to Write and T1 Hint prefetchwt1#C-https://www.felixcloutier.com/x86/prefetchwt1 VCVTTPH2UDQ VCVTTPH2UDQ vcvttph2udqpConvert with Truncation Packed Half-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Values vcvttph2udq.K vcvttph2udq<K vcvttph2udq>R vcvttph2udqK vcvttph2udqK vcvttph2udqR vcvttph2udq.K vcvttph2udqK vcvttph2udq<K vcvttph2udqK vcvttph2udq>R vcvttph2udqR vcvttph2udqRR vcvttph2udqRR-https://www.felixcloutier.com/x86/vcvttph2udq VCVTUSI2SS VCVTUSI2SS vcvtusi2ssHConvert Unsigned Integer to Scalar Single-Precision Floating-Point Value vcvtusi2sslH vcvtusi2ssl'H vcvtusi2sslQH,https://www.felixcloutier.com/x86/vcvtusi2ssPSLLQPSLLQpsllq'Shift Packed Quadword Data Left Logicalpsllq psllq psllq+ psllqpsllqpsllq/3https://www.felixcloutier.com/x86/psllw:pslld:psllqVFMSUBADD231PHVFMSUBADD231PHvfmsubadd231phVFused Multiply-Alternating Subtract/Add of Packed Half-Precision Floating-Point Valuesvfmsubadd231ph<Kvfmsubadd231phKvfmsubadd231ph>Kvfmsubadd231phKvfmsubadd231ph@Rvfmsubadd231phRvfmsubadd231ph<Kvfmsubadd231phKvfmsubadd231ph>Kvfmsubadd231phKvfmsubadd231ph@Rvfmsubadd231phRvfmsubadd231phQRvfmsubadd231phQRNhttps://www.felixcloutier.com/x86/vfmsubadd132ph:vfmsubadd213ph:vfmsubadd231phMAXPSMAXPSmaxps<Return Maximum Packed Single-Precision Floating-Point ValuesmaxpsMAXPSmaxpsMAXPS/'https://www.felixcloutier.com/x86/maxpsSETPSETPsetpSet byte if parity (PF == 1)setpSETPS setpSETPS#PMOVZXBQPMOVZXBQpmovzxbqBMove Packed Byte Integers to Quadword Integers with Zero Extensionpmovzxbqpmovzxbq$SQRTPDSQRTPDsqrtpdECompute Square Roots of Packed Double-Precision Floating-Point ValuessqrtpdSQRTPDsqrtpdSQRTPD/(https://www.felixcloutier.com/x86/sqrtpdJNGJNGjng)Jump if not greater (ZF == 1 or SF != OF)jngJLENjngJLEOVANDPSVANDPSvandpsDBitwise Logical AND of Packed Single-Precision Floating-Point Valuesvandps9JvandpsJvandps:JvandpsJvandps;JvandpsJvandps9Jvandps vandpsJvandps/ vandps:Jvandps vandpsJvandps2 vandps;JvandpsJ VPACKSSWB VPACKSSWB vpacksswb,Pack Words into Bytes with Signed Saturation vpacksswbI vpacksswb/I vpacksswbI vpacksswb2I vpacksswbI vpacksswb5I vpacksswb  vpacksswbI vpacksswb/  vpacksswb/I vpacksswb! vpacksswbI vpacksswb2! vpacksswb2I vpacksswbI vpacksswb5IPMINSBPMINSBpminsb&Minimum of Packed Signed Byte Integerspminsbpminsb//https://www.felixcloutier.com/x86/pminsb:pminswPFPNACCPFPNACCpfpnacc2Packed Floating-Point Positive-Negative Accumulatepfpnaccpfpnacc+BTRBTRbtrBit Test and ResetbtrwBTRW btrwBTRW  btrlBTRLbtrlBTRLbtrwBTRW$btrwBTRW$ btrlBTRL'btrlBTRL'%https://www.felixcloutier.com/x86/btr VRNDSCALEPH VRNDSCALEPH vrndscaleph\Round Packed Half-Precision Floating-Point Values To Include A Given Number Of Fraction Bits vrndscaleph<K vrndscaleph>K vrndscaleph@R vrndscalephK vrndscalephK vrndscalephR vrndscaleph<K vrndscalephK vrndscaleph>K vrndscalephK vrndscaleph@R vrndscalephR vrndscalephRR vrndscalephRR-https://www.felixcloutier.com/x86/vrndscalephPSLLWPSLLWpsllw#Shift Packed Word Data Left Logicalpsllw psllw psllw+ psllwpsllwpsllw/3https://www.felixcloutier.com/x86/psllw:pslld:psllqMASKMOVQMASKMOVQmaskmovq Store Selected Bytes of Quadwordmaskmovq *https://www.felixcloutier.com/x86/maskmovq VEXPANDPS VEXPANDPS vexpandpsKLoad Sparse Packed Single-Precision Floating-Point Values from Dense Memory  vexpandpsH vexpandpsH vexpandpsH vexpandps/H vexpandps2H vexpandps5H vexpandpsH vexpandps/H vexpandpsH vexpandps2H vexpandpsH vexpandps5H+https://www.felixcloutier.com/x86/vexpandpsPI2FDPI2FDpi2fd6Packed Integer to Floating-Point Doubleword Conversionpi2fdpi2fd+ VCVTSD2USI VCVTSD2USI vcvtsd2usiSConvert Scalar Double-Precision Floating-Point Value to Unsigned Doubleword Integer vcvtsd2usiH vcvtsd2usi+H vcvtsd2usiQH,https://www.felixcloutier.com/x86/vcvtsd2usi VFMADD213SS VFMADD213SS vfmadd213ssCFused Multiply-Add of Scalar Single-Precision Floating-Point Values vfmadd213ssH vfmadd213ss'H vfmadd213ss# vfmadd213ssH vfmadd213ss'# vfmadd213ss'H vfmadd213ssQH vfmadd213ssQHEhttps://www.felixcloutier.com/x86/vfmadd132ss:vfmadd213ss:vfmadd231ssVSM4KEY4VSM4KEY4vsm4key4(Perform Four Rounds of SM4 Key Expansionvsm4key4vsm4key4/vsm4key4vsm4key42 VPMADD52HUQ VPMADD52HUQ vpmadd52huqjPacked Multiply of Unsigned 52-bit Unsigned Integers and Add High 52-bit Products to Quadword Accumulators vpmadd52huq=K vpmadd52huqK vpmadd52huq?K vpmadd52huqK vpmadd52huqAO vpmadd52huqO vpmadd52huq=K vpmadd52huqK vpmadd52huq[ vpmadd52huq/[ vpmadd52huq?K vpmadd52huqK vpmadd52huq[ vpmadd52huq2[ vpmadd52huqAO vpmadd52huqO-https://www.felixcloutier.com/x86/vpmadd52huq PUNPCKHQDQ PUNPCKHQDQ punpckhqdq@Unpack and Interleave High-Order Quadwords into Double Quadwords punpckhqdq PUNPCKHQDQ punpckhqdq PUNPCKHQDQ/Jhttps://www.felixcloutier.com/x86/punpckhbw:punpckhwd:punpckhdq:punpckhqdqVPCMOVVPCMOVvpcmovPacked Conditional Movevpcmov"vpcmov/"vpcmov/"vpcmov"vpcmov2"vpcmov2"VCOMISHVCOMISHvcomishJCompare Scalar Ordered Half-Precision Floating-Point Values and Set EFLAGSvcomishRvcomish$RvcomishRR)https://www.felixcloutier.com/x86/vcomishMULPSMULPSmulps6Multiply Packed Single-Precision Floating-Point ValuesmulpsMULPSmulpsMULPS/'https://www.felixcloutier.com/x86/mulpsVPSRLVWVPSRLVWvpsrlvw-Variable Shift Packed Word Data Right Logical vpsrlvwIvpsrlvw/IvpsrlvwIvpsrlvw2IvpsrlvwIvpsrlvw5IvpsrlvwIvpsrlvw/IvpsrlvwIvpsrlvw2IvpsrlvwIvpsrlvw5I9https://www.felixcloutier.com/x86/vpsrlvw:vpsrlvd:vpsrlvq VFNMSUBPD VFNMSUBPD vfnmsubpdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Values vfnmsubpd$ vfnmsubpd/$ vfnmsubpd/$ vfnmsubpd$ vfnmsubpd2$ vfnmsubpd2$VPSLLQVPSLLQvpsllq'Shift Packed Quadword Data Left Logicalvpsllq=Hvpsllq?HvpsllqAHvpsllqHvpsllqHvpsllq/HvpsllqHvpsllqHvpsllq/HvpsllqHvpsllqHvpsllq/Hvpsllq=Hvpsllq vpsllqHvpsllq vpsllqHvpsllq/ vpsllq/Hvpsllq?Hvpsllq!vpsllqHvpsllq!vpsllqHvpsllq/!vpsllq/HvpsllqAHvpsllqHvpsllqHvpsllq/H VPMADD52LUQ VPMADD52LUQ vpmadd52luqdPacked Multiply of Unsigned 52-bit Integers and Add the Low 52-bit Products to Quadword Accumulators vpmadd52luq=K vpmadd52luqK vpmadd52luq?K vpmadd52luqK vpmadd52luqAO vpmadd52luqO vpmadd52luq=K vpmadd52luqK vpmadd52luq[ vpmadd52luq/[ vpmadd52luq?K vpmadd52luqK vpmadd52luq[ vpmadd52luq2[ vpmadd52luqAO vpmadd52luqO-https://www.felixcloutier.com/x86/vpmadd52luqVPHADDBWVPHADDBWvphaddbw0Packed Horizontal Add Signed Byte to Signed Wordvphaddbw"vphaddbw/"VPMADDWDVPMADDWDvpmaddwd,Multiply and Add Packed Signed Word IntegersvpmaddwdIvpmaddwd/IvpmaddwdIvpmaddwd2IvpmaddwdIvpmaddwd5Ivpmaddwd vpmaddwdIvpmaddwd/ vpmaddwd/Ivpmaddwd!vpmaddwdIvpmaddwd2!vpmaddwd2IvpmaddwdIvpmaddwd5IVPSIGNBVPSIGNBvpsignbPacked Sign of Byte Integersvpsignb vpsignb/ vpsignb!vpsignb2!PORPORporPacked Bitwise Logical ORpor por+ porpor/%https://www.felixcloutier.com/x86/por VCVTSH2USI VCVTSH2USI vcvtsh2usiQConvert Scalar Half-Precision Floating-Point Value to Unsigned Doubleword Integer vcvtsh2usiR vcvtsh2usi$R vcvtsh2usiQR,https://www.felixcloutier.com/x86/vcvtsh2usiUNPCKLPSUNPCKLPSunpcklpsGUnpack and Interleave Low Packed Single-Precision Floating-Point ValuesunpcklpsUNPCKLPSunpcklpsUNPCKLPS/*https://www.felixcloutier.com/x86/unpcklpsVPDPBUUDVPDPBUUDvpdpbuudJPacked Dot Product of Unsigned-by-Unsinged Byte subvectors into DoublewordvpdpbuudXvpdpbuud/XvpdpbuudXvpdpbuud2XVHSUBPSVHSUBPSvhsubps$Packed Single-FP Horizontal Subtractvhsubps vhsubps/ vhsubps vhsubps2  VFMSUB213PH VFMSUB213PH vfmsub213phFFused Multiply-Subtract of Packed Half-Precision Floating-Point Values vfmsub213ph<K vfmsub213phK vfmsub213ph>K vfmsub213phK vfmsub213ph@R vfmsub213phR vfmsub213ph<K vfmsub213phK vfmsub213ph>K vfmsub213phK vfmsub213ph@R vfmsub213phR vfmsub213phQR vfmsub213phQRlhttps://www.felixcloutier.com/x86/vfmsub132ph:vfnmsub132ph:vfmsub213ph:vfnmsub213ph:vfmsub231ph:vfnmsub231phPDEPPDEPpdepParallel Bits Depositpdepl5pdepl'5&https://www.felixcloutier.com/x86/pdep VFNMADD132PD VFNMADD132PD vfnmadd132pdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Values vfnmadd132pd=H vfnmadd132pdH vfnmadd132pd?H vfnmadd132pdH vfnmadd132pdAH vfnmadd132pdH vfnmadd132pd=H vfnmadd132pd# vfnmadd132pdH vfnmadd132pd/# vfnmadd132pd?H vfnmadd132pd# vfnmadd132pdH vfnmadd132pd2# vfnmadd132pdAH vfnmadd132pdH vfnmadd132pdQH vfnmadd132pdQHHhttps://www.felixcloutier.com/x86/vfnmadd132pd:vfnmadd213pd:vfnmadd231pd VCOMPRESSPD VCOMPRESSPD vcompresspdUStore Sparse Packed Double-Precision Floating-Point Values into Dense Memory/Register  vcompresspdH vcompresspd0H vcompresspdH vcompresspd3H vcompresspdH vcompresspd6H vcompresspdH vcompresspdH vcompresspdH vcompresspd/H vcompresspd2H vcompresspd5H-https://www.felixcloutier.com/x86/vcompresspdKTESTWKTESTWktestw#Bit Test 16-bit Masks and Set FlagsktestwJ=https://www.felixcloutier.com/x86/ktestw:ktestb:ktestq:ktestd VPMOVZXWQ VPMOVZXWQ vpmovzxwqBMove Packed Word Integers to Quadword Integers with Zero Extension vpmovzxwqH vpmovzxwqH vpmovzxwqH vpmovzxwq'H vpmovzxwq+H vpmovzxwq/H vpmovzxwq  vpmovzxwqH vpmovzxwq'  vpmovzxwq'H vpmovzxwq! vpmovzxwqH vpmovzxwq+! vpmovzxwq+H vpmovzxwqH vpmovzxwq/HCMPCMPcmpCompare Two OperandscmpbCMPBcmpbCMPB cmpbCMPB  cmpbCMPB #cmpwCMPW cmpwCMPW cmpwCMPW cmpwCMPW  cmpwCMPW $cmplCMPLcmplCMPLcmplCMPLcmplCMPLcmplCMPL'cmpbCMPB#cmpbCMPB# cmpwCMPW$cmpwCMPW$cmpwCMPW$ cmplCMPL'cmplCMPL'cmplCMPL'%https://www.felixcloutier.com/x86/cmpVPERMT2DVPERMT2Dvpermt2d?Full Permute of Doublewords From Two Tables Overwriting a Table vpermt2d9Hvpermt2dHvpermt2d:Hvpermt2dHvpermt2d;Hvpermt2dHvpermt2d9Hvpermt2dHvpermt2d:Hvpermt2dHvpermt2d;Hvpermt2dHPhttps://www.felixcloutier.com/x86/vpermt2w:vpermt2d:vpermt2q:vpermt2ps:vpermt2pdVPMACSWWVPMACSWWvpmacsww5Packed Multiply Accumulate Signed Word to Signed Wordvpmacsww"vpmacsww/" VPMOVSXDQ VPMOVSXDQ vpmovsxdqHMove Packed Doubleword Integers to Quadword Integers with Sign Extension vpmovsxdqH vpmovsxdqH vpmovsxdqH vpmovsxdq+H vpmovsxdq/H vpmovsxdq2H vpmovsxdq  vpmovsxdqH vpmovsxdq+  vpmovsxdq+H vpmovsxdq! vpmovsxdqH vpmovsxdq/! vpmovsxdq/H vpmovsxdqH vpmovsxdq2HSTMXCSRSTMXCSRstmxcsrStore MXCSR Register Statestmxcsr')https://www.felixcloutier.com/x86/stmxcsrRDTSCRDTSCrdtscRead Time-Stamp CounterrdtscRDTSC'https://www.felixcloutier.com/x86/rdtscDECDECdecDecrement by 1decbDECB decwDECW declDECLdecbDECB#decwDECW$declDECL'%https://www.felixcloutier.com/x86/decJLJLjlJump if less (SF != OF)jlJLTNjlJLTOMWAITMWAITmwait Monitor WaitmwaitD'https://www.felixcloutier.com/x86/mwaitPEXTRWPEXTRWpextrw Extract Wordpextrw pextrwpextrw$(https://www.felixcloutier.com/x86/pextrwVPCOMDVPCOMDvpcomd)Compare Packed Signed Doubleword Integersvpcomd"vpcomd/" VFNMSUB132SD VFNMSUB132SD vfnmsub132sdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfnmsub132sdH vfnmsub132sd+H vfnmsub132sd# vfnmsub132sdH vfnmsub132sd+# vfnmsub132sd+H vfnmsub132sdQH vfnmsub132sdQHHhttps://www.felixcloutier.com/x86/vfnmsub132sd:vfnmsub213sd:vfnmsub231sdJAEJAEjae Jump if above or equal (CF == 0)jaeJCCNjaeJCCOMOVNTIMOVNTImovnti(Store Doubleword Using Non-Temporal Hintmovntil'(https://www.felixcloutier.com/x86/movntiVPSIGNWVPSIGNWvpsignwPacked Sign of Word Integersvpsignw vpsignw/ vpsignw!vpsignw2! VCVTTPH2DQ VCVTTPH2DQ vcvttph2dqPConvert with Truncation Packed Half-Precision FP Values to Packed Dword Integers vcvttph2dq.K vcvttph2dq<K vcvttph2dq>R vcvttph2dqK vcvttph2dqK vcvttph2dqR vcvttph2dq.K vcvttph2dqK vcvttph2dq<K vcvttph2dqK vcvttph2dq>R vcvttph2dqR vcvttph2dqRR vcvttph2dqRR,https://www.felixcloutier.com/x86/vcvttph2dq VCVTPH2PSX VCVTPH2PSX vcvtph2psx>Convert Half-Precision FP Values to Single-Precision FP Values vcvtph2psx.K vcvtph2psx<K vcvtph2psx>R vcvtph2psxK vcvtph2psxK vcvtph2psxR vcvtph2psx.K vcvtph2psxK vcvtph2psx<K vcvtph2psxK vcvtph2psx>R vcvtph2psxR vcvtph2psxRR vcvtph2psxRR6https://www.felixcloutier.com/x86/vcvtph2ps:vcvtph2psxRDPMCRDPMCrdpmc#Read Performance-Monitoring Counterrdpmc-'https://www.felixcloutier.com/x86/rdpmc VFCMULCSH VFCMULCSH vfcmulcshOFused Conjugate Multiply of Complex Scalar Half-Precision Floating-Point Values vfcmulcshR vfcmulcsh'R vfcmulcshR vfcmulcsh'R vfcmulcshQR vfcmulcshQR4https://www.felixcloutier.com/x86/vfcmulcsh:vfmulcshPADDUSBPADDUSBpaddusb:Add Packed Unsigned Byte Integers with Unsigned Saturationpaddusb paddusb+ paddusbpaddusb/1https://www.felixcloutier.com/x86/paddusb:padduswPFRSQIT1PFRSQIT1pfrsqit18Packed Floating-Point Reciprocal Square Root Iteration 1pfrsqit1pfrsqit1+SHA1MSG1SHA1MSG1sha1msg1NPerform an Intermediate Calculation for the Next Four SHA1 Message Doublewordssha1msg1(sha1msg1/(*https://www.felixcloutier.com/x86/sha1msg1 VSCALEFPD VSCALEFPD vscalefpd_Scale Packed Double-Precision Floating-Point Values With Double-Precision Floating-Point Values vscalefpd=H vscalefpdH vscalefpd?H vscalefpdH vscalefpdAH vscalefpdH vscalefpd=H vscalefpdH vscalefpd?H vscalefpdH vscalefpdAH vscalefpdH vscalefpdQH vscalefpdQH+https://www.felixcloutier.com/x86/vscalefpdVPSHLDVWVPSHLDVWvpshldvw<Concatenate and Variable Shift Packed Word Data Left Logical vpshldvwKvpshldvw/KvpshldvwKvpshldvw2KvpshldvwUvpshldvw5UvpshldvwKvpshldvw/KvpshldvwKvpshldvw2KvpshldvwUvpshldvw5U VCVTTPH2W VCVTTPH2W vcvttph2waConvert with Truncation Packed Half-Precision Floating-Point Values to Packed Word Integer Values vcvttph2w<K vcvttph2w>K vcvttph2w@R vcvttph2wK vcvttph2wK vcvttph2wR vcvttph2w<K vcvttph2wK vcvttph2w>K vcvttph2wK vcvttph2w@R vcvttph2wR vcvttph2wRR vcvttph2wRR+https://www.felixcloutier.com/x86/vcvttph2wKMOVQKMOVQkmovqMove 64-bit MaskkmovqIkmovq+Ikmovq+I9https://www.felixcloutier.com/x86/kmovw:kmovb:kmovq:kmovdJNPJNPjnpJump if not parity (PF == 0)jnpJPCNjnpJPCO VPBLENDMD VPBLENDMD vpblendmd0Blend Doubleword Vectors Using an OpMask Control  vpblendmd9H vpblendmdH vpblendmd:H vpblendmdH vpblendmd;H vpblendmdH vpblendmd9H vpblendmdH vpblendmd:H vpblendmdH vpblendmd;H vpblendmdH5https://www.felixcloutier.com/x86/vpblendmd:vpblendmqCMOVECMOVEcmoveMove if equal (ZF == 1)cmovew  cmovew $cmovelcmovel'INCINCincIncrement by 1incbINCB incwINCW inclINCLincbINCB#incwINCW$inclINCL'%https://www.felixcloutier.com/x86/incJGEJGEjge#Jump if greater or equal (SF == OF)jgeJGENjgeJGEO VINSERTI32X8 VINSERTI32X8 vinserti32x83Insert 256 Bits of Packed Doubleword Integer Values vinserti32x8J vinserti32x82J vinserti32x8J vinserti32x82JVMOVHPDVMOVHPDvmovhpd6Move High Packed Double-Precision Floating-Point Valuevmovhpd+ vmovhpd+Hvmovhpd+ vmovhpd+HSETESETEseteSet byte if equal (ZF == 1)seteSETEQ seteSETEQ#MOVDIRIMOVDIRImovdiriMOVe to DIRect store Integermovdiri'0)https://www.felixcloutier.com/x86/movdiri VCVTQQ2PD VCVTQQ2PD vcvtqq2pdQConvert Packed Quadword Integers to Packed Double-Precision Floating-Point Values vcvtqq2pd=J vcvtqq2pd?J vcvtqq2pdAJ vcvtqq2pdJ vcvtqq2pdJ vcvtqq2pdJ vcvtqq2pd=J vcvtqq2pdJ vcvtqq2pd?J vcvtqq2pdJ vcvtqq2pdAJ vcvtqq2pdJ vcvtqq2pdQJ vcvtqq2pdQJ+https://www.felixcloutier.com/x86/vcvtqq2pdVPTESTVPTESTvptestPacked Logical Comparevptest vptest/ vptest vptest2 PCMPGTQPCMPGTQpcmpgtq$Compare Packed Data for Greater Thanpcmpgtqpcmpgtq/)https://www.felixcloutier.com/x86/pcmpgtqVPHADDWDVPHADDWDvphaddwd6Packed Horizontal Add Signed Word to Signed Doublewordvphaddwd"vphaddwd/" VPMOVUSQW VPMOVUSQW vpmovusqwKDown Convert Packed Quadword Values to Word Values with Unsigned Saturation  vpmovusqwH vpmovusqw(H vpmovusqwH vpmovusqw,H vpmovusqwH vpmovusqw0H vpmovusqwH vpmovusqwH vpmovusqwH vpmovusqw'H vpmovusqw+H vpmovusqw/H<https://www.felixcloutier.com/x86/vpmovqw:vpmovsqw:vpmovusqwKANDWKANDWkandw Bitwise Logical AND 16-bit MaskskandwH9https://www.felixcloutier.com/x86/kandw:kandb:kandq:kanddVBROADCASTF32X4VBROADCASTF32X4vbroadcastf32x47Broadcast Four Single-Precision Floating-Point Elementsvbroadcastf32x4/Hvbroadcastf32x4/Hvbroadcastf32x4/Hvbroadcastf32x4/HVPHADDSWVPHADDSWvphaddswAPacked Horizontal Add Signed Word Integers with Signed Saturationvphaddsw vphaddsw/ vphaddsw!vphaddsw2!DIVPSDIVPSdivps4Divide Packed Single-Precision Floating-Point ValuesdivpsDIVPSdivpsDIVPS/'https://www.felixcloutier.com/x86/divps VPMADCSWD VPMADCSWD vpmadcswd?Packed Multiply Add Accumulate Signed Word to Signed Doubleword vpmadcswd" vpmadcswd/" VCVTUQQ2PS VCVTUQQ2PS vcvtuqq2psZConvert Packed Unsigned Quadword Integers to Packed Single-Precision Floating-Point Values vcvtuqq2psx=J vcvtuqq2psy?J vcvtuqq2psAJ vcvtuqq2psxJ vcvtuqq2psyJ vcvtuqq2psJ vcvtuqq2psx=J vcvtuqq2psy?J vcvtuqq2psxJ vcvtuqq2psyJ vcvtuqq2psAJ vcvtuqq2psJ vcvtuqq2psQJ vcvtuqq2psQJ,https://www.felixcloutier.com/x86/vcvtuqq2psVPXORVPXORvpxor#Packed Bitwise Logical Exclusive ORvpxor vpxor/ vpxor!vpxor2!BLSIBLSIblsiIsolate Lowest Set Bitblsil4blsil'4&https://www.felixcloutier.com/x86/blsi VPHADDUWD VPHADDUWD vphadduwd1Packed Horizontal Add Unsigned Word to Doubleword vphadduwd" vphadduwd/" VREDUCESD VREDUCESD vreducesdRPerform Reduction Transformation on a Scalar Double-Precision Floating-Point Value vreducesdJ vreducesd+J vreducesdJ vreducesd+J+https://www.felixcloutier.com/x86/vreducesdVSCATTERPF0QPSVSCATTERPF0QPSvscatterpf0qps‚Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint with Intent to Writevscatterpf0qpsML]https://www.felixcloutier.com/x86/vscatterpf0dps:vscatterpf0qps:vscatterpf0dpd:vscatterpf0qpdCVTPD2PICVTPD2PIcvtpd2piBConvert Packed Double-Precision FP Values to Packed Dword Integerscvtpd2piCVTPD2PLcvtpd2piCVTPD2PL/*https://www.felixcloutier.com/x86/cvtpd2piCWDCWDcwdConvert Word to Doublewordcwtd-https://www.felixcloutier.com/x86/cwd:cdq:cqo EXTRACTPS EXTRACTPS extractps4Extract Packed Single Precision Floating-Point Value extractps extractps'+https://www.felixcloutier.com/x86/extractpsVPMULLDVPMULLDvpmulld?Multiply Packed Signed Doubleword Integers and Store Low Resultvpmulld9HvpmulldHvpmulld:HvpmulldHvpmulld;HvpmulldHvpmulld9Hvpmulld vpmulldHvpmulld/ vpmulld:Hvpmulld!vpmulldHvpmulld2!vpmulld;HvpmulldHSHUFPSSHUFPSshufps5Shuffle Packed Single-Precision Floating-Point Valuesshufpsshufps/(https://www.felixcloutier.com/x86/shufpsVSUBSSVSUBSSvsubss6Subtract Scalar Single-Precision Floating-Point ValuesvsubssHvsubss'Hvsubss vsubssHvsubss' vsubss'HvsubssQHvsubssQHVMAXSSVMAXSSvmaxss;Return Maximum Scalar Single-Precision Floating-Point ValuevmaxssHvmaxss'Hvmaxss vmaxssHvmaxss' vmaxss'HvmaxssRHvmaxssRHVPERMDVPERMDvpermdPermute Doubleword Integers vpermd:HvpermdHvpermd;HvpermdHvpermd:Hvpermd!vpermdHvpermd2!vpermd;HvpermdH/https://www.felixcloutier.com/x86/vpermd:vpermwCMOVPCMOVPcmovpMove if parity (PF == 1)cmovpw  cmovpw $cmovplcmovpl'INTOINTOinto#Interrupt 4 If Overflow Flag is SetintoINTO5https://www.felixcloutier.com/x86/intn:into:int3:int1 VFPCLASSPH VFPCLASSPH vfpclassph9Test Class of Packed Half-Precision Floating-Point Values  vfpclassphx<K vfpclassphx<K vfpclassphy>K vfpclassphy>K vfpclassphz@R vfpclassphz@R vfpclassphxK vfpclassphxK vfpclassphyK vfpclassphyK vfpclassphzR vfpclassphzR,https://www.felixcloutier.com/x86/vfpclassphCMOVZCMOVZcmovzMove if zero (ZF == 1)cmovzw  cmovzw $cmovzlcmovzl'VPBROADCASTMB2QVPBROADCASTMB2Qvpbroadcastmb2q=Broadcast Low Byte of Mask Register to Packed Quadword Valuesvpbroadcastmb2qNvpbroadcastmb2qNvpbroadcastmb2qNVPERMI2DVPERMI2Dvpermi2dAFull Permute of Doublewords From Two Tables Overwriting the Index vpermi2d9Hvpermi2dHvpermi2d:Hvpermi2dHvpermi2d;Hvpermi2dHvpermi2d9Hvpermi2dHvpermi2d:Hvpermi2dHvpermi2d;Hvpermi2dHPhttps://www.felixcloutier.com/x86/vpermi2w:vpermi2d:vpermi2q:vpermi2ps:vpermi2pdPMAXSWPMAXSWpmaxsw&Maximum of Packed Signed Word IntegerspmaxswPMAXSW pmaxswPMAXSW+ pmaxswPMAXSWpmaxswPMAXSW/=https://www.felixcloutier.com/x86/pmaxsb:pmaxsw:pmaxsd:pmaxsq VPEXPANDB VPEXPANDB vpexpandbALoad Sparse Packed Byte Integer Values from Dense Memory/Register  vpexpandbK vpexpandbK vpexpandbU vpexpandb/K vpexpandb2K vpexpandb5U vpexpandbK vpexpandb/K vpexpandbK vpexpandb2K vpexpandbU vpexpandb5U5https://www.felixcloutier.com/x86/vpexpandb:vpexpandwKADDBKADDBkaddbADD Two 8-bit MaskskaddbJ9https://www.felixcloutier.com/x86/kaddw:kaddb:kaddq:kaddd VREDUCEPS VREDUCEPS vreducepsQPerform Reduction Transformation on Packed Single-Precision Floating-Point Values  vreduceps9J vreduceps:J vreduceps;J vreducepsJ vreducepsJ vreducepsJ vreduceps9J vreducepsJ vreduceps:J vreducepsJ vreduceps;J vreducepsJ+https://www.felixcloutier.com/x86/vreducepsPUSHPUSHpushPush Value Onto the StackpushqpushqpushwPUSHW pushlPUSHLpushwPUSHW$pushlPUSHL'&https://www.felixcloutier.com/x86/pushCMOVCCMOVCcmovcMove if carry (CF == 1)cmovcw  cmovcw $cmovclcmovcl'MAXSDMAXSDmaxsd;Return Maximum Scalar Double-Precision Floating-Point ValuemaxsdMAXSDmaxsdMAXSD+'https://www.felixcloutier.com/x86/maxsdORPDORPDorpd<Bitwise Logical OR of Double-Precision Floating-Point ValuesorpdORPDorpdORPD/&https://www.felixcloutier.com/x86/orpdVFMULCSHVFMULCSHvfmulcshEFused Multiply of Complex Scalar Half-Precision Floating-Point ValuesvfmulcshRvfmulcsh'RvfmulcshRvfmulcsh'RvfmulcshQRvfmulcshQR4https://www.felixcloutier.com/x86/vfcmulcsh:vfmulcsh VFNMSUB213PS VFNMSUB213PS vfnmsub213psQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Values vfnmsub213ps9H vfnmsub213psH vfnmsub213ps:H vfnmsub213psH vfnmsub213ps;H vfnmsub213psH vfnmsub213ps9H vfnmsub213ps# vfnmsub213psH vfnmsub213ps/# vfnmsub213ps:H vfnmsub213ps# vfnmsub213psH vfnmsub213ps2# vfnmsub213ps;H vfnmsub213psH vfnmsub213psQH vfnmsub213psQHHhttps://www.felixcloutier.com/x86/vfnmsub132ps:vfnmsub213ps:vfnmsub231ps VFMADD231SD VFMADD231SD vfmadd231sdCFused Multiply-Add of Scalar Double-Precision Floating-Point Values vfmadd231sdH vfmadd231sd+H vfmadd231sd# vfmadd231sdH vfmadd231sd+# vfmadd231sd+H vfmadd231sdQH vfmadd231sdQHEhttps://www.felixcloutier.com/x86/vfmadd132sd:vfmadd213sd:vfmadd231sdPMULHWPMULHWpmulhw:Multiply Packed Signed Word Integers and Store High Resultpmulhw pmulhw+ pmulhwpmulhw/(https://www.felixcloutier.com/x86/pmulhw VFPCLASSPS VFPCLASSPS vfpclassps;Test Class of Packed Single-Precision Floating-Point Values  vfpclasspsx9J vfpclasspsx9J vfpclasspsy:J vfpclasspsy:J vfpclasspsz;J vfpclasspsz;J vfpclasspsxJ vfpclasspsxJ vfpclasspsyJ vfpclasspsyJ vfpclasspszJ vfpclasspszJ,https://www.felixcloutier.com/x86/vfpclasspsVPANDQVPANDQvpandq/Bitwise Logical AND of Packed Quadword Integers vpandq=HvpandqHvpandq?HvpandqHvpandqAHvpandqHvpandq=HvpandqHvpandq?HvpandqHvpandqAHvpandqHSETLSETLsetlSet byte if less (SF != OF)setlSETLT setlSETLT# CVTTPD2DQ CVTTPD2DQ cvttpd2dqRConvert with Truncation Packed Double-Precision FP Values to Packed Dword Integers cvttpd2dq cvttpd2dq/+https://www.felixcloutier.com/x86/cvttpd2dqMAXSSMAXSSmaxss;Return Maximum Scalar Single-Precision Floating-Point ValuemaxssMAXSSmaxssMAXSS''https://www.felixcloutier.com/x86/maxssVCVTNE2PS2BF16VCVTNE2PS2BF16vcvtne2ps2bf16XConvert with Nearest-Even rounding 2 Single-Precision FP vectors into BFloat16 FP vector vcvtne2ps2bf169Kvcvtne2ps2bf16Kvcvtne2ps2bf16:Kvcvtne2ps2bf16Kvcvtne2ps2bf16;Qvcvtne2ps2bf16Qvcvtne2ps2bf169Kvcvtne2ps2bf16Kvcvtne2ps2bf16:Kvcvtne2ps2bf16Kvcvtne2ps2bf16;Qvcvtne2ps2bf16Q0https://www.felixcloutier.com/x86/vcvtne2ps2bf16VPERMI2QVPERMI2Qvpermi2q?Full Permute of Quadwords From Two Tables Overwriting the Index vpermi2q=Hvpermi2qHvpermi2q?Hvpermi2qHvpermi2qAHvpermi2qHvpermi2q=Hvpermi2qHvpermi2q?Hvpermi2qHvpermi2qAHvpermi2qHPhttps://www.felixcloutier.com/x86/vpermi2w:vpermi2d:vpermi2q:vpermi2ps:vpermi2pdVPSUBSWVPSUBSWvpsubsw;Subtract Packed Signed Word Integers with Signed SaturationvpsubswIvpsubsw/IvpsubswIvpsubsw2IvpsubswIvpsubsw5Ivpsubsw vpsubswIvpsubsw/ vpsubsw/Ivpsubsw!vpsubswIvpsubsw2!vpsubsw2IvpsubswIvpsubsw5IADDSSADDSSaddss1Add Scalar Single-Precision Floating-Point ValuesaddssADDSSaddssADDSS''https://www.felixcloutier.com/x86/addss VCVTNEPS2BF16 VCVTNEPS2BF16 vcvtneps2bf16YConvert with Nearest-Even rounding a Single-Precision FP vector into a BFloat16 FP vectorvcvtneps2bf16x9Kvcvtneps2bf16y:K vcvtneps2bf16;Qvcvtneps2bf16xKvcvtneps2bf16yK vcvtneps2bf16Qvcvtneps2bf16x9Kvcvtneps2bf16y:Kvcvtneps2bf16xKvcvtneps2bf16xZvcvtneps2bf16yKvcvtneps2bf16yZvcvtneps2bf16x/Zvcvtneps2bf16y2Z vcvtneps2bf16;Q vcvtneps2bf16Q/https://www.felixcloutier.com/x86/vcvtneps2bf16VSQRTSSVSQRTSSvsqrtssCCompute Square Root of Scalar Single-Precision Floating-Point ValuevsqrtssHvsqrtss'Hvsqrtss vsqrtssHvsqrtss' vsqrtss'HvsqrtssQHvsqrtssQHKSHIFTLQKSHIFTLQkshiftlqShift Left 64-bit MaskskshiftlqIEhttps://www.felixcloutier.com/x86/kshiftlw:kshiftlb:kshiftlq:kshiftldCVTDQ2PDCVTDQ2PDcvtdq2pdBConvert Packed Dword Integers to Packed Double-Precision FP Valuescvtdq2pdcvtdq2pd+*https://www.felixcloutier.com/x86/cvtdq2pd VCVTNEOPH2PS VCVTNEOPH2PS vcvtneoph2ps9Convert Odd Elements of Packed FP16 Values to FP32 Values vcvtneoph2ps/Z vcvtneoph2ps2Z VFNMSUB132SH VFNMSUB132SH vfnmsub132shOFused Negative Multiply-Subtract of Scalar Half-Precision Floating-Point Values vfnmsub132shR vfnmsub132sh$R vfnmsub132shR vfnmsub132sh$R vfnmsub132shQR vfnmsub132shQRlhttps://www.felixcloutier.com/x86/vfmsub132sh:vfnmsub132sh:vfmsub213sh:vfnmsub213sh:vfmsub231sh:vfnmsub231sh VPTERNLOGD VPTERNLOGD vpternlogd6Bitwise Ternary Logical Operation on Doubleword Values  vpternlogd9H vpternlogdH vpternlogd:H vpternlogdH vpternlogd;H vpternlogdH vpternlogd9H vpternlogdH vpternlogd:H vpternlogdH vpternlogd;H vpternlogdH7https://www.felixcloutier.com/x86/vpternlogd:vpternlogqDIVDIVdivUnsigned DividedivbDIVB divwDIVW divlDIVLdivbDIVB#divwDIVW$divlDIVL'%https://www.felixcloutier.com/x86/divJMPJMPjmpJump UnconditionallyjmpJMPNjmpJMPOjmplJMPjmplJMP'%https://www.felixcloutier.com/x86/jmpMINSSMINSSminss;Return Minimum Scalar Single-Precision Floating-Point ValueminssMINSSminssMINSS''https://www.felixcloutier.com/x86/minssVPHADDWQVPHADDWQvphaddwq4Packed Horizontal Add Signed Word to Signed Quadwordvphaddwq"vphaddwq/"VPINSRDVPINSRDvpinsrdInsert Doublewordvpinsrd vpinsrdJvpinsrd' vpinsrd'JVPMULDQVPMULDQvpmuldqDMultiply Packed Signed Doubleword Integers and Store Quadword Resultvpmuldq=HvpmuldqHvpmuldq?HvpmuldqHvpmuldqAHvpmuldqHvpmuldq=Hvpmuldq vpmuldqHvpmuldq/ vpmuldq?Hvpmuldq!vpmuldqHvpmuldq2!vpmuldqAHvpmuldqHPFRCPIT2PFRCPIT2pfrcpit2,Packed Floating-Point Reciprocal Iteration 2pfrcpit2pfrcpit2+VPSHLDVDVPSHLDVDvpshldvdBConcatenate and Variable Shift Packed Doubleword Data Left Logical vpshldvd9KvpshldvdKvpshldvd:KvpshldvdKvpshldvd;UvpshldvdUvpshldvd9KvpshldvdKvpshldvd:KvpshldvdKvpshldvd;UvpshldvdUCVTPD2PSCVTPD2PScvtpd2psNConvert Packed Double-Precision FP Values to Packed Single-Precision FP Valuescvtpd2psCVTPD2PScvtpd2psCVTPD2PS/*https://www.felixcloutier.com/x86/cvtpd2psVPMACSDDVPMACSDDvpmacsddAPacked Multiply Accumulate Signed Doubleword to Signed Doublewordvpmacsdd"vpmacsdd/"VFMADDSSVFMADDSSvfmaddssCFused Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfmaddss$vfmaddss'$vfmaddss'$VPMOVSQBVPMOVSQBvpmovsqbIDown Convert Packed Quadword Values to Byte Values with Signed Saturation vpmovsqbHvpmovsqb%HvpmovsqbHvpmovsqb(HvpmovsqbHvpmovsqb,HvpmovsqbHvpmovsqbHvpmovsqbHvpmovsqb$Hvpmovsqb'Hvpmovsqb+H<https://www.felixcloutier.com/x86/vpmovqb:vpmovsqb:vpmovusqbADDPSADDPSaddps1Add Packed Single-Precision Floating-Point ValuesaddpsADDPSaddpsADDPS/'https://www.felixcloutier.com/x86/addps VRSQRT14PD VRSQRT14PD vrsqrt14pd`Compute Approximate Reciprocals of Square Roots of Packed Double-Precision Floating-Point Values  vrsqrt14pd=H vrsqrt14pd?H vrsqrt14pdAH vrsqrt14pdH vrsqrt14pdH vrsqrt14pdH vrsqrt14pd=H vrsqrt14pdH vrsqrt14pd?H vrsqrt14pdH vrsqrt14pdAH vrsqrt14pdH,https://www.felixcloutier.com/x86/vrsqrt14pdVFMADDSUB132PDVFMADDSUB132PDvfmaddsub132pdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsub132pd=Hvfmaddsub132pdHvfmaddsub132pd?Hvfmaddsub132pdHvfmaddsub132pdAHvfmaddsub132pdHvfmaddsub132pd=Hvfmaddsub132pd#vfmaddsub132pdHvfmaddsub132pd/#vfmaddsub132pd?Hvfmaddsub132pd#vfmaddsub132pdHvfmaddsub132pd2#vfmaddsub132pdAHvfmaddsub132pdHvfmaddsub132pdQHvfmaddsub132pdQHNhttps://www.felixcloutier.com/x86/vfmaddsub132pd:vfmaddsub213pd:vfmaddsub231pdJEJEjeJump if equal (ZF == 1)jeJEQNjeJEQOPBLENDWPBLENDWpblendwBlend Packed Wordspblendwpblendw/)https://www.felixcloutier.com/x86/pblendwVCVTPH2WVCVTPH2Wvcvtph2wQConvert Packed Half-Precision Floating-Point Values to Packed Word Integer Valuesvcvtph2w<Kvcvtph2w>Kvcvtph2w@Rvcvtph2wKvcvtph2wKvcvtph2wRvcvtph2w<Kvcvtph2wKvcvtph2w>Kvcvtph2wKvcvtph2w@Rvcvtph2wRvcvtph2wQRvcvtph2wQR*https://www.felixcloutier.com/x86/vcvtph2w VFMSUB132SD VFMSUB132SD vfmsub132sdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfmsub132sdH vfmsub132sd+H vfmsub132sd# vfmsub132sdH vfmsub132sd+# vfmsub132sd+H vfmsub132sdQH vfmsub132sdQHEhttps://www.felixcloutier.com/x86/vfmsub132sd:vfmsub213sd:vfmsub231sdJBEJBEjbe+Jump if below or equal (CF == 1 or ZF == 1)jbeJLSNjbeJLSOCLCCLCclcClear Carry FlagclcCLC%https://www.felixcloutier.com/x86/clc VPERMIL2PD VPERMIL2PD vpermil2pd:Permute Two-Source Double-Precision Floating-Point Vectors vpermil2pd" vpermil2pd/" vpermil2pd/" vpermil2pd" vpermil2pd2" vpermil2pd2"VPROTQVPROTQvprotqPacked Rotate Quadwordsvprotq"vprotq"vprotq/"vprotq/"vprotq/"STCSTCstcSet Carry FlagstcSTC%https://www.felixcloutier.com/x86/stc VPCOMPRESSD VPCOMPRESSD vpcompressdHStore Sparse Packed Doubleword Integer Values into Dense Memory/Register  vpcompressdH vpcompressd0H vpcompressdH vpcompressd3H vpcompressdH vpcompressd6H vpcompressdH vpcompressdH vpcompressdH vpcompressd/H vpcompressd2H vpcompressd5H-https://www.felixcloutier.com/x86/vpcompressdVSHUFPSVSHUFPSvshufps5Shuffle Packed Single-Precision Floating-Point Valuesvshufps9HvshufpsHvshufps:HvshufpsHvshufps;HvshufpsHvshufps9Hvshufps vshufpsHvshufps/ vshufps:Hvshufps vshufpsHvshufps2 vshufps;HvshufpsHVCVTW2PHVCVTW2PHvcvtw2phKConvert Packed Word Integers to Packed Half-Precision Floating-Point Valuesvcvtw2ph<Kvcvtw2ph>Kvcvtw2ph@Rvcvtw2phKvcvtw2phKvcvtw2phRvcvtw2ph<Kvcvtw2phKvcvtw2ph>Kvcvtw2phKvcvtw2ph@Rvcvtw2phRvcvtw2phQRvcvtw2phQR*https://www.felixcloutier.com/x86/vcvtw2ph VCVTUDQ2PH VCVTUDQ2PH vcvtudq2phZConvert Packed Unsigned Doubleword Integers to Packed Half-Precision Floating-Point Values vcvtudq2phx9K vcvtudq2phy:K vcvtudq2ph;R vcvtudq2phxK vcvtudq2phyK vcvtudq2phR vcvtudq2phx9K vcvtudq2phy:K vcvtudq2phxK vcvtudq2phyK vcvtudq2ph;R vcvtudq2phR vcvtudq2phQR vcvtudq2phQR,https://www.felixcloutier.com/x86/vcvtudq2phVDIVSDVDIVSDvdivsd4Divide Scalar Double-Precision Floating-Point ValuesvdivsdHvdivsd+Hvdivsd vdivsdHvdivsd+ vdivsd+HvdivsdQHvdivsdQHJNLJNLjnlJump if not less (SF == OF)jnlJGENjnlJGEOVANDNPDVANDNPDvandnpdHBitwise Logical AND NOT of Packed Double-Precision Floating-Point Valuesvandnpd=JvandnpdJvandnpd?JvandnpdJvandnpdAJvandnpdJvandnpd=Jvandnpd vandnpdJvandnpd/ vandnpd?Jvandnpd vandnpdJvandnpd2 vandnpdAJvandnpdJSAHFSAHFsahfStore AH into FlagssahfSAHF&https://www.felixcloutier.com/x86/sahfCMOVNBECMOVNBEcmovnbe0Move if not below or equal (CF == 0 and ZF == 0)cmovnbew  cmovnbew $cmovnbelcmovnbel'VFMSUBADD132PDVFMSUBADD132PDvfmsubadd132pdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubadd132pd=Hvfmsubadd132pdHvfmsubadd132pd?Hvfmsubadd132pdHvfmsubadd132pdAHvfmsubadd132pdHvfmsubadd132pd=Hvfmsubadd132pd#vfmsubadd132pdHvfmsubadd132pd/#vfmsubadd132pd?Hvfmsubadd132pd#vfmsubadd132pdHvfmsubadd132pd2#vfmsubadd132pdAHvfmsubadd132pdHvfmsubadd132pdQHvfmsubadd132pdQHNhttps://www.felixcloutier.com/x86/vfmsubadd132pd:vfmsubadd213pd:vfmsubadd231pd VFPCLASSSH VFPCLASSSH vfpclasssh8Test Class of Scalar Half-Precision Floating-Point Value vfpclassshR vfpclassshR vfpclasssh$R vfpclasssh$R,https://www.felixcloutier.com/x86/vfpclassshVPABSQVPABSQvpabsq*Packed Absolute Value of Quadword Integers vpabsq=Hvpabsq?HvpabsqAHvpabsqHvpabsqHvpabsqHvpabsq=HvpabsqHvpabsq?HvpabsqHvpabsqAHvpabsqHVRCP28PDVRCP28PDvrcp28pdtApproximation to the Reciprocal of Packed Double-Precision Floating-Point Values with Less Than 2^-28 Relative Errorvrcp28pdAMvrcp28pdMvrcp28pdAMvrcp28pdMvrcp28pdRMvrcp28pdRM*https://www.felixcloutier.com/x86/vrcp28pdVFMADDPDVFMADDPDvfmaddpdCFused Multiply-Add of Packed Double-Precision Floating-Point Valuesvfmaddpd$vfmaddpd/$vfmaddpd/$vfmaddpd$vfmaddpd2$vfmaddpd2$ VPMACSSWW VPMACSSWW vpmacsswwEPacked Multiply Accumulate with Saturation Signed Word to Signed Word vpmacssww" vpmacssww/"CLDEMOTECLDEMOTEcldemoteCache Line Demotecldemote#8*https://www.felixcloutier.com/x86/cldemoteCMOVNSCMOVNScmovnsMove if not sign (SF == 0)cmovnsw  cmovnsw $cmovnslcmovnsl'RDPIDRDPIDrdpidRead Processor IDrdpid,'https://www.felixcloutier.com/x86/rdpidMOVNTSDMOVNTSDmovntsdKStore Scalar Double-Precision Floating-Point Values Using Non-Temporal Hintmovntsd+MFENCEMFENCEmfence Memory FencemfenceMFENCE(https://www.felixcloutier.com/x86/mfenceLAHFLAHFlahfLoad AH from FlagslahfLAHF&https://www.felixcloutier.com/x86/lahfKXNORBKXNORBkxnorb Bitwise Logical XNOR 8-bit MaskskxnorbJ=https://www.felixcloutier.com/x86/kxnorw:kxnorb:kxnorq:kxnordRDPRURDPRUrdpru$Read Processor Register in User moderdpru.VPSUBUSBVPSUBUSBvpsubusb?Subtract Packed Unsigned Byte Integers with Unsigned SaturationvpsubusbIvpsubusb/IvpsubusbIvpsubusb2IvpsubusbIvpsubusb5Ivpsubusb vpsubusbIvpsubusb/ vpsubusb/Ivpsubusb!vpsubusbIvpsubusb2!vpsubusb2IvpsubusbIvpsubusb5IPHADDWPHADDWphaddw#Packed Horizontal Add Word Integersphaddwphaddw+phaddwphaddw//https://www.felixcloutier.com/x86/phaddw:phadddVPMOVB2MVPMOVB2Mvpmovb2m3Move Signs of Packed Byte Integers to Mask Registervpmovb2mIvpmovb2mIvpmovb2mIEhttps://www.felixcloutier.com/x86/vpmovb2m:vpmovw2m:vpmovd2m:vpmovq2mJZJZjzJump if zero (ZF == 1)jzJEQNjzJEQO VCVTNEEPH2PS VCVTNEEPH2PS vcvtneeph2ps:Convert Even Elements of Packed FP16 Values to FP32 Values vcvtneeph2ps/Z vcvtneeph2ps2ZKSHIFTLBKSHIFTLBkshiftlbShift Left 8-bit MaskskshiftlbJEhttps://www.felixcloutier.com/x86/kshiftlw:kshiftlb:kshiftlq:kshiftld VCVTTSS2SI VCVTTSS2SI vcvttss2siIConvert with Truncation Scalar Single-Precision FP Value to Dword Integer vcvttss2si  vcvttss2siH vcvttss2si'  vcvttss2si'H vcvttss2siRH GF2P8MULB GF2P8MULB gf2p8mulbGalois Field Multiply Bytes gf2p8mulb gf2p8mulb/+https://www.felixcloutier.com/x86/gf2p8mulbVPSRAQVPSRAQvpsraq+Shift Packed Quadword Data Right Arithmeticvpsraq=Hvpsraq?HvpsraqAHvpsraqHvpsraqHvpsraq/HvpsraqHvpsraqHvpsraq/HvpsraqHvpsraqHvpsraq/Hvpsraq=HvpsraqHvpsraqHvpsraq/Hvpsraq?HvpsraqHvpsraqHvpsraq/HvpsraqAHvpsraqHvpsraqHvpsraq/HVANDPDVANDPDvandpdDBitwise Logical AND of Packed Double-Precision Floating-Point Valuesvandpd=JvandpdJvandpd?JvandpdJvandpdAJvandpdJvandpd=Jvandpd vandpdJvandpd/ vandpd?Jvandpd vandpdJvandpd2 vandpdAJvandpdJPFSUBPFSUBpfsubPacked Floating-Point Subtractpfsubpfsub+VFMADDSUB213PSVFMADDSUB213PSvfmaddsub213psXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsub213ps9Hvfmaddsub213psHvfmaddsub213ps:Hvfmaddsub213psHvfmaddsub213ps;Hvfmaddsub213psHvfmaddsub213ps9Hvfmaddsub213ps#vfmaddsub213psHvfmaddsub213ps/#vfmaddsub213ps:Hvfmaddsub213ps#vfmaddsub213psHvfmaddsub213ps2#vfmaddsub213ps;Hvfmaddsub213psHvfmaddsub213psQHvfmaddsub213psQHNhttps://www.felixcloutier.com/x86/vfmaddsub132ps:vfmaddsub213ps:vfmaddsub231psVPCMPUWVPCMPUWvpcmpuw#Compare Packed Unsigned Word Values vpcmpuwIvpcmpuwIvpcmpuw/Ivpcmpuw/IvpcmpuwIvpcmpuwIvpcmpuw2Ivpcmpuw2IvpcmpuwIvpcmpuwIvpcmpuw5Ivpcmpuw5I0https://www.felixcloutier.com/x86/vpcmpw:vpcmpuw VPHADDUBD VPHADDUBD vphaddubd1Packed Horizontal Add Unsigned Byte to Doubleword vphaddubd" vphaddubd/"VPORVPORvporPacked Bitwise Logical ORvpor vpor/ vpor!vpor2! VPUNPCKLDQ VPUNPCKLDQ vpunpckldq:Unpack and Interleave Low-Order Doublewords into Quadwords vpunpckldq9H vpunpckldqH vpunpckldq:H vpunpckldqH vpunpckldq;H vpunpckldqH vpunpckldq9H vpunpckldq  vpunpckldqH vpunpckldq/  vpunpckldq:H vpunpckldq! vpunpckldqH vpunpckldq2! vpunpckldq;H vpunpckldqHVPMOVM2WVPMOVM2Wvpmovm2w4Expand Bits of Mask Register to Packed Word Integersvpmovm2wIvpmovm2wIvpmovm2wIEhttps://www.felixcloutier.com/x86/vpmovm2b:vpmovm2w:vpmovm2d:vpmovm2q VREDUCESS VREDUCESS vreducessRPerform Reduction Transformation on a Scalar Single-Precision Floating-Point Value vreducessJ vreducess'J vreducessJ vreducess'J+https://www.felixcloutier.com/x86/vreducessPINSRDPINSRDpinsrdInsert DoublewordpinsrdPINSRDpinsrdPINSRD'6https://www.felixcloutier.com/x86/pinsrb:pinsrd:pinsrqINTINTintCall to Interrupt ProcedureintINT VFNMSUB213SH VFNMSUB213SH vfnmsub213shOFused Negative Multiply-Subtract of Scalar Half-Precision Floating-Point Values vfnmsub213shR vfnmsub213sh$R vfnmsub213shR vfnmsub213sh$R vfnmsub213shQR vfnmsub213shQRlhttps://www.felixcloutier.com/x86/vfmsub132sh:vfnmsub132sh:vfmsub213sh:vfnmsub213sh:vfmsub231sh:vfnmsub231shMOVDQAMOVDQAmovdqaMove Aligned Double QuadwordmovdqaMOVOmovdqaMOVO/movdqaMOVO/<https://www.felixcloutier.com/x86/movdqa:vmovdqa32:vmovdqa64BZHIBZHIbzhi3Zero High Bits Starting with Specified Bit Positionbzhil5bzhil'5&https://www.felixcloutier.com/x86/bzhi PUNPCKLQDQ PUNPCKLQDQ punpcklqdq?Unpack and Interleave Low-Order Quadwords into Double Quadwords punpcklqdq PUNPCKLQDQ punpcklqdq PUNPCKLQDQ/Jhttps://www.felixcloutier.com/x86/punpcklbw:punpcklwd:punpckldq:punpcklqdqKORQKORQkorqBitwise Logical OR 64-bit MaskskorqI5https://www.felixcloutier.com/x86/korw:korb:korq:kord VCVTTSH2USI VCVTTSH2USI vcvttsh2usiVConvert with Truncation Scalar Half-Precision Floating-Point Value to Unsigned Integer vcvttsh2usiR vcvttsh2usi$R vcvttsh2usiRR-https://www.felixcloutier.com/x86/vcvttsh2usi VCVTTSS2USI VCVTTSS2USI vcvttss2usiXConvert with Truncation Scalar Single-Precision Floating-Point Value to Unsigned Integer vcvttss2usiH vcvttss2usi'H vcvttss2usiRH-https://www.felixcloutier.com/x86/vcvttss2usi VDBPSADBW VDBPSADBW vdbpsadbw>Double Block Packed Sum-Absolute-Differences on Unsigned Bytes  vdbpsadbwI vdbpsadbw/I vdbpsadbwI vdbpsadbw2I vdbpsadbwI vdbpsadbw5I vdbpsadbwI vdbpsadbw/I vdbpsadbwI vdbpsadbw2I vdbpsadbwI vdbpsadbw5I+https://www.felixcloutier.com/x86/vdbpsadbw VPTESTNMW VPTESTNMW vptestnmw7Logical NAND of Packed Word Integer Values and Set Mask  vptestnmwI vptestnmwI vptestnmw/I vptestnmw/I vptestnmwI vptestnmwI vptestnmw2I vptestnmw2I vptestnmwI vptestnmwI vptestnmw5I vptestnmw5IIhttps://www.felixcloutier.com/x86/vptestnmb:vptestnmw:vptestnmd:vptestnmq VREDUCESH VREDUCESH vreduceshPPerform Reduction Transformation on a Scalar Half-Precision Floating-Point Value vreduceshR vreducesh$R vreduceshR vreducesh$R vreduceshRR vreduceshRR+https://www.felixcloutier.com/x86/vreducesh VPMADDUBSW VPMADDUBSW vpmaddubsw9Multiply and Add Packed Signed and Unsigned Byte Integers vpmaddubswI vpmaddubsw/I vpmaddubswI vpmaddubsw2I vpmaddubswI vpmaddubsw5I vpmaddubsw  vpmaddubswI vpmaddubsw/  vpmaddubsw/I vpmaddubsw! vpmaddubswI vpmaddubsw2! vpmaddubsw2I vpmaddubswI vpmaddubsw5IVAESENCVAESENCvaesenc+Perform One Round of an AES Encryption Flow vaesenc vaesencKvaesenc/ vaesenc/KvaesencvaesencKvaesenc2vaesenc2KvaesencHvaesenc5HIDIVIDIVidiv Signed DivideidivbIDIVB idivwIDIVW idivlIDIVLidivbIDIVB#idivwIDIVW$idivlIDIVL'&https://www.felixcloutier.com/x86/idiv VFNMSUBSD VFNMSUBSD vfnmsubsdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Values vfnmsubsd$ vfnmsubsd+$ vfnmsubsd+$VCMPPSVCMPPSvcmpps5Compare Packed Single-Precision Floating-Point Valuesvcmpps9Hvcmpps9HvcmppsHvcmppsHvcmpps:Hvcmpps:HvcmppsHvcmppsHvcmpps;Hvcmpps;HvcmppsHvcmppsHvcmpps vcmpps/ vcmpps vcmpps2 vcmppsRHvcmppsRHKUNPCKWDKUNPCKWDkunpckwd"Unpack and Interleave 16-bit MaskskunpckwdI<https://www.felixcloutier.com/x86/kunpckbw:kunpckwd:kunpckdqDPPDDPPDdppd<Dot Product of Packed Double Precision Floating-Point Valuesdppddppd/&https://www.felixcloutier.com/x86/dppd VCVTPH2QQ VCVTPH2QQ vcvtph2qq\Convert Packed Half Precision Floating-Point Values to Packed Singed Quadword Integer Values vcvtph2qq*K vcvtph2qq.K vcvtph2qq<R vcvtph2qqK vcvtph2qqK vcvtph2qqR vcvtph2qq*K vcvtph2qqK vcvtph2qq.K vcvtph2qqK vcvtph2qq<R vcvtph2qqR vcvtph2qqQR vcvtph2qqQR+https://www.felixcloutier.com/x86/vcvtph2qqVPSHUFBVPSHUFBvpshufbPacked Shuffle BytesvpshufbIvpshufb/IvpshufbIvpshufb2IvpshufbIvpshufb5Ivpshufb vpshufbIvpshufb/ vpshufb/Ivpshufb!vpshufbIvpshufb2!vpshufb2IvpshufbIvpshufb5IVPSRLQVPSRLQvpsrlq(Shift Packed Quadword Data Right Logicalvpsrlq=Hvpsrlq?HvpsrlqAHvpsrlqHvpsrlqHvpsrlq/HvpsrlqHvpsrlqHvpsrlq/HvpsrlqHvpsrlqHvpsrlq/Hvpsrlq=Hvpsrlq vpsrlqHvpsrlq vpsrlqHvpsrlq/ vpsrlq/Hvpsrlq?Hvpsrlq!vpsrlqHvpsrlq!vpsrlqHvpsrlq/!vpsrlq/HvpsrlqAHvpsrlqHvpsrlqHvpsrlq/HVSHUFPDVSHUFPDvshufpd5Shuffle Packed Double-Precision Floating-Point Valuesvshufpd=HvshufpdHvshufpd?HvshufpdHvshufpdAHvshufpdHvshufpd=Hvshufpd vshufpdHvshufpd/ vshufpd?Hvshufpd vshufpdHvshufpd2 vshufpdAHvshufpdHPFCMPGEPFCMPGEpfcmpge2Packed Floating-Point Compare for Greater or Equalpfcmpgepfcmpge+VFMADDSUB231PHVFMADDSUB231PHvfmaddsub231phVFused Multiply-Alternating Add/Subtract of Packed Half-Precision Floating-Point Valuesvfmaddsub231ph<Kvfmaddsub231phKvfmaddsub231ph>Kvfmaddsub231phKvfmaddsub231ph@Rvfmaddsub231phRvfmaddsub231ph<Kvfmaddsub231phKvfmaddsub231ph>Kvfmaddsub231phKvfmaddsub231ph@Rvfmaddsub231phRvfmaddsub231phQRvfmaddsub231phQRNhttps://www.felixcloutier.com/x86/vfmaddsub132ph:vfmaddsub213ph:vfmaddsub231ph VMOVSHDUP VMOVSHDUP vmovshdup(Move Packed Single-FP High and Duplicate vmovshdupH vmovshdupH vmovshdupH vmovshdup/H vmovshdup2H vmovshdup5H vmovshdup  vmovshdupH vmovshdup/  vmovshdup/H vmovshdup  vmovshdupH vmovshdup2  vmovshdup2H vmovshdupH vmovshdup5HCMOVSCMOVScmovsMove if sign (SF == 1)cmovsw  cmovsw $cmovslcmovsl'VPMACSWDVPMACSWDvpmacswd;Packed Multiply Accumulate Signed Word to Signed Doublewordvpmacswd"vpmacswd/"VUCOMISDVUCOMISDvucomisdNUnordered Compare Scalar Double-Precision Floating-Point Values and Set EFLAGSvucomisd vucomisdHvucomisd+ vucomisd+HvucomisdRHVMAXSHVMAXSHvmaxsh9Return Maximum Scalar Half-Precision Floating-Point ValuevmaxshRvmaxsh$RvmaxshRvmaxsh$RvmaxshRRvmaxshRR(https://www.felixcloutier.com/x86/vmaxshVFMSUBADD132PHVFMSUBADD132PHvfmsubadd132phVFused Multiply-Alternating Subtract/Add of Packed Half-Precision Floating-Point Valuesvfmsubadd132ph<Kvfmsubadd132phKvfmsubadd132ph>Kvfmsubadd132phKvfmsubadd132ph@Rvfmsubadd132phRvfmsubadd132ph<Kvfmsubadd132phKvfmsubadd132ph>Kvfmsubadd132phKvfmsubadd132ph@Rvfmsubadd132phRvfmsubadd132phQRvfmsubadd132phQRNhttps://www.felixcloutier.com/x86/vfmsubadd132ph:vfmsubadd213ph:vfmsubadd231phSETSSETSsetsSet byte if sign (SF == 1)setsSETMI setsSETMI#PMINUWPMINUWpminuw(Minimum of Packed Unsigned Word Integerspminuwpminuw//https://www.felixcloutier.com/x86/pminub:pminuw VFNMADD132PS VFNMADD132PS vfnmadd132psLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Values vfnmadd132ps9H vfnmadd132psH vfnmadd132ps:H vfnmadd132psH vfnmadd132ps;H vfnmadd132psH vfnmadd132ps9H vfnmadd132ps# vfnmadd132psH vfnmadd132ps/# vfnmadd132ps:H vfnmadd132ps# vfnmadd132psH vfnmadd132ps2# vfnmadd132ps;H vfnmadd132psH vfnmadd132psQH vfnmadd132psQHHhttps://www.felixcloutier.com/x86/vfnmadd132ps:vfnmadd213ps:vfnmadd231ps VPSCATTERQD VPSCATTERQD vpscatterqd=Scatter Packed Doubleword Values with Signed Quadword Indices vpscatterqdEH vpscatterqdIH vpscatterqdMHQhttps://www.felixcloutier.com/x86/vpscatterdd:vpscatterdq:vpscatterqd:vpscatterqqVSCATTERPF0DPSVSCATTERPF0DPSvscatterpf0dps„Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint with Intent to Writevscatterpf0dpsKL]https://www.felixcloutier.com/x86/vscatterpf0dps:vscatterpf0qps:vscatterpf0dpd:vscatterpf0qpdVPMOVM2QVPMOVM2Qvpmovm2q8Expand Bits of Mask Register to Packed Quadword Integersvpmovm2qJvpmovm2qJvpmovm2qJEhttps://www.felixcloutier.com/x86/vpmovm2b:vpmovm2w:vpmovm2d:vpmovm2qRCLRCLrclRotate Left through Carry FlagrclbRCLB rclbRCLB rclbRCLB rclwRCLW rclwRCLW rclwRCLW rcllRCLLrcllRCLLrcllRCLLrclbRCLB#rclbRCLB#rclbRCLB#rclwRCLW$rclwRCLW$rclwRCLW$rcllRCLL'rcllRCLL'rcllRCLL'1https://www.felixcloutier.com/x86/rcl:rcr:rol:rorVSQRTSDVSQRTSDvsqrtsdCCompute Square Root of Scalar Double-Precision Floating-Point ValuevsqrtsdHvsqrtsd+Hvsqrtsd vsqrtsdHvsqrtsd+ vsqrtsd+HvsqrtsdQHvsqrtsdQHVMOVAPSVMOVAPSvmovaps:Move Aligned Packed Single-Precision Floating-Point Valuesvmovaps0HvmovapsHvmovaps3HvmovapsHvmovaps6HvmovapsHvmovaps/Hvmovaps2Hvmovaps5Hvmovaps vmovapsHvmovaps/ vmovaps/Hvmovaps vmovapsHvmovaps2 vmovaps2HvmovapsHvmovaps5Hvmovaps/ vmovaps/Hvmovaps2 vmovaps2Hvmovaps5HCVTSI2SSCVTSI2SScvtsi2ss9Convert Dword Integer to Scalar Single-Precision FP Value cvtsi2sslCVTSL2SS cvtsi2sslCVTSL2SS'*https://www.felixcloutier.com/x86/cvtsi2ss VFMSUB213SS VFMSUB213SS vfmsub213ssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Values vfmsub213ssH vfmsub213ss'H vfmsub213ss# vfmsub213ssH vfmsub213ss'# vfmsub213ss'H vfmsub213ssQH vfmsub213ssQHEhttps://www.felixcloutier.com/x86/vfmsub132ss:vfmsub213ss:vfmsub231ssPTESTPTESTptestPacked Logical Compareptestptest/'https://www.felixcloutier.com/x86/ptest VGF2P8MULB VGF2P8MULB vgf2p8mulbGalois Field Multiply Bytes vgf2p8mulb vgf2p8mulb/ vgf2p8mulb vgf2p8mulb2 vgf2p8mulb vgf2p8mulb5 vgf2p8mulb vgf2p8mulb vgf2p8mulb/ vgf2p8mulb/ vgf2p8mulb vgf2p8mulb vgf2p8mulb2 vgf2p8mulb2 vgf2p8mulb vgf2p8mulb5 CVTTPS2DQ CVTTPS2DQ cvttps2dqRConvert with Truncation Packed Single-Precision FP Values to Packed Dword Integers cvttps2dq cvttps2dq/+https://www.felixcloutier.com/x86/cvttps2dqVPCMPBVPCMPBvpcmpb!Compare Packed Signed Byte Values vpcmpbIvpcmpbIvpcmpb/Ivpcmpb/IvpcmpbIvpcmpbIvpcmpb2Ivpcmpb2IvpcmpbIvpcmpbIvpcmpb5Ivpcmpb5I0https://www.felixcloutier.com/x86/vpcmpb:vpcmpub VSCALEFPS VSCALEFPS vscalefps_Scale Packed Single-Precision Floating-Point Values With Single-Precision Floating-Point Values vscalefps9H vscalefpsH vscalefps:H vscalefpsH vscalefps;H vscalefpsH vscalefps9H vscalefpsH vscalefps:H vscalefpsH vscalefps;H vscalefpsH vscalefpsQH vscalefpsQH+https://www.felixcloutier.com/x86/vscalefpsPMAXSDPMAXSDpmaxsd,Maximum of Packed Signed Doubleword Integerspmaxsdpmaxsd/=https://www.felixcloutier.com/x86/pmaxsb:pmaxsw:pmaxsd:pmaxsq VPTESTNMB VPTESTNMB vptestnmb7Logical NAND of Packed Byte Integer Values and Set Mask  vptestnmbI vptestnmbI vptestnmb/I vptestnmb/I vptestnmbI vptestnmbI vptestnmb2I vptestnmb2I vptestnmbI vptestnmbI vptestnmb5I vptestnmb5IIhttps://www.felixcloutier.com/x86/vptestnmb:vptestnmw:vptestnmd:vptestnmqSETNAESETNAEsetnae(Set byte if not above or equal (CF == 1)setnaeSETCS setnaeSETCS#CVTPS2PDCVTPS2PDcvtps2pdNConvert Packed Single-Precision FP Values to Packed Double-Precision FP Valuescvtps2pdCVTPS2PDcvtps2pdCVTPS2PD+*https://www.felixcloutier.com/x86/cvtps2pdJNCJNCjncJump if not carry (CF == 0)jncJCCNjncJCCOVCMPSSVCMPSSvcmpss5Compare Scalar Single-Precision Floating-Point ValuesvcmpssHvcmpssHvcmpss'Hvcmpss'Hvcmpss vcmpss' vcmpssRHvcmpssRHVPSHLDQVPSHLDQvpshldq7Concatenate and Shift Packed Quadword Data Left Logical vpshldq=KvpshldqKvpshldq?KvpshldqKvpshldqAUvpshldqUvpshldq=KvpshldqKvpshldq?KvpshldqKvpshldqAUvpshldqUKADDQKADDQkaddqADD Two 64-bit MaskskaddqI9https://www.felixcloutier.com/x86/kaddw:kaddb:kaddq:kaddd VRSQRT28PD VRSQRT28PD vrsqrt28pd€Approximation to the Reciprocal Square Root of Packed Double-Precision Floating-Point Values with Less Than 2^-28 Relative Error vrsqrt28pdAM vrsqrt28pdM vrsqrt28pdAM vrsqrt28pdM vrsqrt28pdRM vrsqrt28pdRM,https://www.felixcloutier.com/x86/vrsqrt28pd VFNMSUB231PS VFNMSUB231PS vfnmsub231psQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Values vfnmsub231ps9H vfnmsub231psH vfnmsub231ps:H vfnmsub231psH vfnmsub231ps;H vfnmsub231psH vfnmsub231ps9H vfnmsub231ps# vfnmsub231psH vfnmsub231ps/# vfnmsub231ps:H vfnmsub231ps# vfnmsub231psH vfnmsub231ps2# vfnmsub231ps;H vfnmsub231psH vfnmsub231psQH vfnmsub231psQHHhttps://www.felixcloutier.com/x86/vfnmsub132ps:vfnmsub213ps:vfnmsub231psVPCMPDVPCMPDvpcmpd'Compare Packed Signed Doubleword Values vpcmpd9Hvpcmpd9HvpcmpdHvpcmpdHvpcmpd:Hvpcmpd:HvpcmpdHvpcmpdHvpcmpd;Hvpcmpd;HvpcmpdHvpcmpdH0https://www.felixcloutier.com/x86/vpcmpd:vpcmpudCMOVPOCMOVPOcmovpoMove if parity odd (PF == 0)cmovpow  cmovpow $cmovpolcmovpol'VPORQVPORQvporq.Bitwise Logical OR of Packed Quadword Integers vporq=HvporqHvporq?HvporqHvporqAHvporqHvporq=HvporqHvporq?HvporqHvporqAHvporqHSALSALsalArithmetic Shift LeftsalbSALB salbSALB salbSALB salwSALW salwSALW salwSALW sallSALLsallSALLsallSALLsalbSALB#salbSALB#salbSALB#salwSALW$salwSALW$salwSALW$sallSALL'sallSALL'sallSALL'1https://www.felixcloutier.com/x86/sal:sar:shl:shrT1MSKCT1MSKCt1mskcInverse Mask From Trailing Onest1mskc6t1mskc'6 VFMADD132SS VFMADD132SS vfmadd132ssCFused Multiply-Add of Scalar Single-Precision Floating-Point Values vfmadd132ssH vfmadd132ss'H vfmadd132ss# vfmadd132ssH vfmadd132ss'# vfmadd132ss'H vfmadd132ssQH vfmadd132ssQHEhttps://www.felixcloutier.com/x86/vfmadd132ss:vfmadd213ss:vfmadd231ssVPMULUDQVPMULUDQvpmuludq,Multiply Packed Unsigned Doubleword Integersvpmuludq=HvpmuludqHvpmuludq?HvpmuludqHvpmuludqAHvpmuludqHvpmuludq=Hvpmuludq vpmuludqHvpmuludq/ vpmuludq?Hvpmuludq!vpmuludqHvpmuludq2!vpmuludqAHvpmuludqHKSHIFTLDKSHIFTLDkshiftldShift Left 32-bit MaskskshiftldIEhttps://www.felixcloutier.com/x86/kshiftlw:kshiftlb:kshiftlq:kshiftldMOVZXMOVZXmovzxMove with Zero-ExtendmovzbwMOVBWZX  movzbwMOVBWZX #movzblMOVBLZX movzwlMOVWLZX movzblMOVBLZX#movzwlMOVWLZX$'https://www.felixcloutier.com/x86/movzx VCVTTSH2SI VCVTTSH2SI vcvttsh2siGConvert with Truncation Scalar Half-Precision FP Value to Dword Integer vcvttsh2siR vcvttsh2si$R vcvttsh2siRR,https://www.felixcloutier.com/x86/vcvttsh2si VFNMADDPS VFNMADDPS vfnmaddpsLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Values vfnmaddps$ vfnmaddps/$ vfnmaddps/$ vfnmaddps$ vfnmaddps2$ vfnmaddps2$VPMOVWBVPMOVWBvpmovwb>Down Convert Packed Word Values to Byte Values with Truncation vpmovwbIvpmovwb,IvpmovwbIvpmovwb0IvpmovwbIvpmovwb3IvpmovwbIvpmovwbIvpmovwbIvpmovwb+Ivpmovwb/Ivpmovwb2I<https://www.felixcloutier.com/x86/vpmovwb:vpmovswb:vpmovuswbVFMSUBPSVFMSUBPSvfmsubpsHFused Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfmsubps$vfmsubps/$vfmsubps/$vfmsubps$vfmsubps2$vfmsubps2$POPPOPpopPop a Value from the StackpopwPOPW poplPOPLpopwPOPW$poplPOPL'%https://www.felixcloutier.com/x86/popSETNSSETNSsetnsSet byte if not sign (SF == 0)setnsSETPL setnsSETPL# VUNPCKLPD VUNPCKLPD vunpcklpdGUnpack and Interleave Low Packed Double-Precision Floating-Point Values vunpcklpd=H vunpcklpdH vunpcklpd?H vunpcklpdH vunpcklpdAH vunpcklpdH vunpcklpd=H vunpcklpd  vunpcklpdH vunpcklpd/  vunpcklpd?H vunpcklpd  vunpcklpdH vunpcklpd2  vunpcklpdAH vunpcklpdHPMINSDPMINSDpminsd,Minimum of Packed Signed Doubleword Integerspminsdpminsd//https://www.felixcloutier.com/x86/pminsd:pminsq VPERMI2PD VPERMI2PD vpermi2pd\Full Permute of Double-Precision Floating-Point Values From Two Tables Overwriting the Index  vpermi2pd=H vpermi2pdH vpermi2pd?H vpermi2pdH vpermi2pdAH vpermi2pdH vpermi2pd=H vpermi2pdH vpermi2pd?H vpermi2pdH vpermi2pdAH vpermi2pdHPhttps://www.felixcloutier.com/x86/vpermi2w:vpermi2d:vpermi2q:vpermi2ps:vpermi2pdSQRTPSSQRTPSsqrtpsECompute Square Roots of Packed Single-Precision Floating-Point ValuessqrtpsSQRTPSsqrtpsSQRTPS/(https://www.felixcloutier.com/x86/sqrtpsCMPPDCMPPDcmppd5Compare Packed Double-Precision Floating-Point ValuescmppdCMPPDcmppdCMPPD/'https://www.felixcloutier.com/x86/cmppdPMOVSXWQPMOVSXWQpmovsxwqBMove Packed Word Integers to Quadword Integers with Sign Extensionpmovsxwqpmovsxwq' VCVTTSD2SI VCVTTSD2SI vcvttsd2siJConvert with Truncation Scalar Double-Precision FP Value to Signed Integer vcvttsd2si  vcvttsd2siH vcvttsd2si+  vcvttsd2si+H vcvttsd2siRHVPERMQVPERMQvpermqPermute Quadword Integersvpermq?HvpermqAHvpermq?HvpermqHvpermqHvpermqAHvpermqHvpermqHvpermq?Hvpermq?Hvpermq!vpermqHvpermqHvpermq2!vpermqAHvpermqAHvpermqHvpermqH(https://www.felixcloutier.com/x86/vpermq VCVTTPS2QQ VCVTTPS2QQ vcvttps2qqnConvert with Truncation Packed Single Precision Floating-Point Values to Packed Singed Quadword Integer Values vcvttps2qq8J vcvttps2qq9J vcvttps2qq:J vcvttps2qqJ vcvttps2qqJ vcvttps2qqJ vcvttps2qq8J vcvttps2qqJ vcvttps2qq9J vcvttps2qqJ vcvttps2qq:J vcvttps2qqJ vcvttps2qqRJ vcvttps2qqRJ,https://www.felixcloutier.com/x86/vcvttps2qq VPMOVZXBW VPMOVZXBW vpmovzxbw>Move Packed Byte Integers to Word Integers with Zero Extension vpmovzxbwI vpmovzxbwI vpmovzxbwI vpmovzxbw+I vpmovzxbw/I vpmovzxbw2I vpmovzxbw  vpmovzxbwI vpmovzxbw+  vpmovzxbw+I vpmovzxbw! vpmovzxbwI vpmovzxbw/! vpmovzxbw/I vpmovzxbwI vpmovzxbw2IPSIGNBPSIGNBpsignbPacked Sign of Byte Integerspsignbpsignb+psignbpsignb/6https://www.felixcloutier.com/x86/psignb:psignw:psigndRCPPSRCPPSrcppsPCompute Approximate Reciprocals of Packed Single-Precision Floating-Point ValuesrcppsRCPPSrcppsRCPPS/'https://www.felixcloutier.com/x86/rcppsVFRCZSSVFRCZSSvfrczss7Extract Fraction Scalar Single-Precision Floating Pointvfrczss"vfrczss'"HADDPDHADDPDhaddpdPacked Double-FP Horizontal Addhaddpdhaddpd/(https://www.felixcloutier.com/x86/haddpdSETAESETAEsetae$Set byte if above or equal (CF == 0)setaeSETCC setaeSETCC#VRCP14SDVRCP14SDvrcp14sdPCompute Approximate Reciprocal of a Scalar Double-Precision Floating-Point Valuevrcp14sdHvrcp14sd+Hvrcp14sdHvrcp14sd+H*https://www.felixcloutier.com/x86/vrcp14sd VCVTDQ2PS VCVTDQ2PS vcvtdq2psBConvert Packed Dword Integers to Packed Single-Precision FP Values vcvtdq2ps9H vcvtdq2ps:H vcvtdq2ps;H vcvtdq2psH vcvtdq2psH vcvtdq2psH vcvtdq2ps9H vcvtdq2ps  vcvtdq2psH vcvtdq2ps/  vcvtdq2ps:H vcvtdq2ps  vcvtdq2psH vcvtdq2ps2  vcvtdq2ps;H vcvtdq2psH vcvtdq2psQH vcvtdq2psQH VPACKSSDW VPACKSSDW vpackssdw2Pack Doublewords into Words with Signed Saturation vpackssdw9I vpackssdwI vpackssdw:I vpackssdwI vpackssdw;I vpackssdwI vpackssdw9I vpackssdw  vpackssdwI vpackssdw/  vpackssdw:I vpackssdw! vpackssdwI vpackssdw2! vpackssdw;I vpackssdwIPAVGUSBPAVGUSBpavgusbAverage Packed Byte Integerspavgusbpavgusb+JNZJNZjnzJump if not zero (ZF == 0)jnzJNENjnzJNEOVFMADDSDVFMADDSDvfmaddsdCFused Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfmaddsd$vfmaddsd+$vfmaddsd+$VPROLQVPROLQvprolqRotate Packed Quadword Left vprolq=Hvprolq?HvprolqAHvprolqHvprolqHvprolqHvprolq=HvprolqHvprolq?HvprolqHvprolqAHvprolqH?https://www.felixcloutier.com/x86/vprold:vprolvd:vprolq:vprolvqBLENDVPDBLENDVPDblendvpd= Variable Blend Packed Double Precision Floating-Point Valuesblendvpdblendvpd/*https://www.felixcloutier.com/x86/blendvpdJNAJNAjna&Jump if not above (CF == 1 or ZF == 1)jnaJLSNjnaJLSO SHA1RNDS4 SHA1RNDS4 sha1rnds4%Perform Four Rounds of SHA1 Operation sha1rnds4( sha1rnds4/(+https://www.felixcloutier.com/x86/sha1rnds4VRANGEPDVRANGEPDvrangepdXRange Restriction Calculation For Packed Pairs of Double-Precision Floating-Point Valuesvrangepd=JvrangepdJvrangepd?JvrangepdJvrangepdAJvrangepdJvrangepd=JvrangepdJvrangepd?JvrangepdJvrangepdAJvrangepdJvrangepdRJvrangepdRJ*https://www.felixcloutier.com/x86/vrangepdAADAADaadASCII Adjust AX Before DivisionaadAADaadAAD%https://www.felixcloutier.com/x86/aadRORRORror Rotate RightrorbRORB rorbRORB rorbRORB rorwRORW rorwRORW rorwRORW rorlRORLrorlRORLrorlRORLrorbRORB#rorbRORB#rorbRORB#rorwRORW$rorwRORW$rorwRORW$rorlRORL'rorlRORL'rorlRORL'1https://www.felixcloutier.com/x86/rcl:rcr:rol:rorUNPCKHPDUNPCKHPDunpckhpdHUnpack and Interleave High Packed Double-Precision Floating-Point ValuesunpckhpdUNPCKHPDunpckhpdUNPCKHPD/*https://www.felixcloutier.com/x86/unpckhpd VPUNPCKHWD VPUNPCKHWD vpunpckhwd7Unpack and Interleave High-Order Words into Doublewords vpunpckhwdI vpunpckhwd/I vpunpckhwdI vpunpckhwd2I vpunpckhwdI vpunpckhwd5I vpunpckhwd  vpunpckhwdI vpunpckhwd/  vpunpckhwd/I vpunpckhwd! vpunpckhwdI vpunpckhwd2! vpunpckhwd2I vpunpckhwdI vpunpckhwd5IVZEROALLVZEROALLvzeroallZero All YMM Registersvzeroall *https://www.felixcloutier.com/x86/vzeroallVPERMI2BVPERMI2Bvpermi2b;Full Permute of Bytes From Two Tables Overwriting the Index vpermi2bTvpermi2b/Tvpermi2bTvpermi2b2Tvpermi2bTvpermi2b5Tvpermi2bTvpermi2b/Tvpermi2bTvpermi2b2Tvpermi2bTvpermi2b5T*https://www.felixcloutier.com/x86/vpermi2bVPSHUFHWVPSHUFHWvpshufhwShuffle Packed High WordsvpshufhwIvpshufhwIvpshufhwIvpshufhw/Ivpshufhw2Ivpshufhw5Ivpshufhw vpshufhwIvpshufhw/ vpshufhw/Ivpshufhw!vpshufhwIvpshufhw2!vpshufhw2IvpshufhwIvpshufhw5IUMWAITUMWAITumwaitUser mode Monitor WaitumwaitG(https://www.felixcloutier.com/x86/umwaitVPOPCNTWVPOPCNTWvpopcntw)Packed Population Count for Word Integers vpopcntwKvpopcntwKvpopcntwSvpopcntw/Kvpopcntw2Kvpopcntw5SvpopcntwKvpopcntw/KvpopcntwKvpopcntw2KvpopcntwSvpopcntw5SMOVDQ2QMOVDQ2Qmovdq2q1Move Quadword from XMM to MMX Technology Registermovdq2q)https://www.felixcloutier.com/x86/movdq2q VFMSUBADDPS VFMSUBADDPS vfmsubaddpsXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Values vfmsubaddps$ vfmsubaddps/$ vfmsubaddps/$ vfmsubaddps$ vfmsubaddps2$ vfmsubaddps2$ROUNDSDROUNDSDroundsd3Round Scalar Double Precision Floating-Point Valuesroundsdroundsd+)https://www.felixcloutier.com/x86/roundsdMOVNTDQMOVNTDQmovntdq-Store Double Quadword Using Non-Temporal HintmovntdqMOVNTO/)https://www.felixcloutier.com/x86/movntdq VEXTRACTF32X8 VEXTRACTF32X8 vextractf32x8AExtract 256 Bits of Packed Single-Precision Floating-Point Values vextractf32x8J vextractf32x83J vextractf32x8J vextractf32x82J VPCONFLICTD VPCONFLICTD vpconflictdWDetect Conflicts Within a Vector of Packed Doubleword Values into Dense Memory/Register  vpconflictd9N vpconflictd:N vpconflictd;N vpconflictdN vpconflictdN vpconflictdN vpconflictd9N vpconflictdN vpconflictd:N vpconflictdN vpconflictd;N vpconflictdN9https://www.felixcloutier.com/x86/vpconflictd:vpconflictqRORXRORXrorx,Rotate Right Logical Without Affecting Flagsrorxl5rorxl'5&https://www.felixcloutier.com/x86/rorxVPMAXSBVPMAXSBvpmaxsb&Maximum of Packed Signed Byte IntegersvpmaxsbIvpmaxsb/IvpmaxsbIvpmaxsb2IvpmaxsbIvpmaxsb5Ivpmaxsb vpmaxsbIvpmaxsb/ vpmaxsb/Ivpmaxsb!vpmaxsbIvpmaxsb2!vpmaxsb2IvpmaxsbIvpmaxsb5I VMOVDQA32 VMOVDQA32 vmovdqa32Move Aligned Doubleword Values vmovdqa320H vmovdqa32H vmovdqa323H vmovdqa32H vmovdqa326H vmovdqa32H vmovdqa32/H vmovdqa322H vmovdqa325H vmovdqa32H vmovdqa32/H vmovdqa32H vmovdqa322H vmovdqa32H vmovdqa325H vmovdqa32/H vmovdqa322H vmovdqa325H<https://www.felixcloutier.com/x86/movdqa:vmovdqa32:vmovdqa64VPMINSBVPMINSBvpminsb&Minimum of Packed Signed Byte IntegersvpminsbIvpminsb/IvpminsbIvpminsb2IvpminsbIvpminsb5Ivpminsb vpminsbIvpminsb/ vpminsb/Ivpminsb!vpminsbIvpminsb2!vpminsb2IvpminsbIvpminsb5IKXNORDKXNORDkxnord!Bitwise Logical XNOR 32-bit MaskskxnordI=https://www.felixcloutier.com/x86/kxnorw:kxnorb:kxnorq:kxnordPINSRWPINSRWpinsrw Insert Wordpinsrw pinsrw$ pinsrwpinsrw$(https://www.felixcloutier.com/x86/pinsrwPSHUFDPSHUFDpshufdShuffle Packed DoublewordspshufdPSHUFLpshufdPSHUFL/(https://www.felixcloutier.com/x86/pshufd VPCONFLICTQ VPCONFLICTQ vpconflictqUDetect Conflicts Within a Vector of Packed Quadword Values into Dense Memory/Register  vpconflictq=N vpconflictq?N vpconflictqAN vpconflictqN vpconflictqN vpconflictqN vpconflictq=N vpconflictqN vpconflictq?N vpconflictqN vpconflictqAN vpconflictqN9https://www.felixcloutier.com/x86/vpconflictd:vpconflictqVPSHUFLWVPSHUFLWvpshuflwShuffle Packed Low WordsvpshuflwIvpshuflwIvpshuflwIvpshuflw/Ivpshuflw2Ivpshuflw5Ivpshuflw vpshuflwIvpshuflw/ vpshuflw/Ivpshuflw!vpshuflwIvpshuflw2!vpshuflw2IvpshuflwIvpshuflw5I VRSQRT14SD VRSQRT14SD vrsqrt14sdaCompute Approximate Reciprocal of a Square Root of a Scalar Double-Precision Floating-Point Value vrsqrt14sdH vrsqrt14sd+H vrsqrt14sdH vrsqrt14sd+H,https://www.felixcloutier.com/x86/vrsqrt14sd VZEROUPPER VZEROUPPER vzeroupper Zero Upper Bits of YMM Registers vzeroupper ,https://www.felixcloutier.com/x86/vzeroupperJAJAja#Jump if above (CF == 0 and ZF == 0)jaJHINjaJHIORDTSCPRDTSCPrdtscp(Read Time-Stamp Counter and Processor IDrdtscp(https://www.felixcloutier.com/x86/rdtscpVFMADDSUB213PDVFMADDSUB213PDvfmaddsub213pdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsub213pd=Hvfmaddsub213pdHvfmaddsub213pd?Hvfmaddsub213pdHvfmaddsub213pdAHvfmaddsub213pdHvfmaddsub213pd=Hvfmaddsub213pd#vfmaddsub213pdHvfmaddsub213pd/#vfmaddsub213pd?Hvfmaddsub213pd#vfmaddsub213pdHvfmaddsub213pd2#vfmaddsub213pdAHvfmaddsub213pdHvfmaddsub213pdQHvfmaddsub213pdQHNhttps://www.felixcloutier.com/x86/vfmaddsub132pd:vfmaddsub213pd:vfmaddsub231pdVMAXPHVMAXPHvmaxph:Return Maximum Packed Half-Precision Floating-Point Valuesvmaxph<KvmaxphKvmaxph>KvmaxphKvmaxph@RvmaxphRvmaxph<KvmaxphKvmaxph>KvmaxphKvmaxph@RvmaxphRvmaxphRRvmaxphRR(https://www.felixcloutier.com/x86/vmaxph VCVTPS2PH VCVTPS2PH vcvtps2ph<Convert Single-Precision FP value to Half-Precision FP value vcvtps2phH vcvtps2ph,H vcvtps2phH vcvtps2ph0H vcvtps2phH vcvtps2ph3H vcvtps2ph% vcvtps2phH vcvtps2ph% vcvtps2phH vcvtps2phH vcvtps2ph+% vcvtps2ph+H vcvtps2ph/% vcvtps2ph/H vcvtps2ph2H vcvtps2phRH vcvtps2phRH+https://www.felixcloutier.com/x86/vcvtps2phVSUBSHVSUBSHvsubsh4Subtract Scalar Half-Precision Floating-Point ValuesvsubshRvsubsh$RvsubshRvsubsh$RvsubshQRvsubshQR(https://www.felixcloutier.com/x86/vsubsh VRNDSCALESH VRNDSCALESH vrndscalesh[Round Scalar Half-Precision Floating-Point Value To Include A Given Number Of Fraction Bits vrndscaleshR vrndscalesh$R vrndscaleshR vrndscalesh$R vrndscaleshRR vrndscaleshRR-https://www.felixcloutier.com/x86/vrndscalesh VPMACSDQH VPMACSDQH vpmacsdqhDPacked Multiply Accumulate Signed High Doubleword to Signed Quadword vpmacsdqh" vpmacsdqh/" GF2P8AFFINEQB GF2P8AFFINEQB gf2p8affineqb(Galois Field (2^8) Affine Transformation gf2p8affineqb gf2p8affineqb//https://www.felixcloutier.com/x86/gf2p8affineqbKORBKORBkorbBitwise Logical OR 8-bit MaskskorbJ5https://www.felixcloutier.com/x86/korw:korb:korq:kord VPMOVZXBQ VPMOVZXBQ vpmovzxbqBMove Packed Byte Integers to Quadword Integers with Zero Extension vpmovzxbqH vpmovzxbqH vpmovzxbqH vpmovzxbq$H vpmovzxbq'H vpmovzxbq+H vpmovzxbq  vpmovzxbqH vpmovzxbq$  vpmovzxbq$H vpmovzxbq! vpmovzxbqH vpmovzxbq'! vpmovzxbq'H vpmovzxbqH vpmovzxbq+HVDPPDVDPPDvdppd<Dot Product of Packed Double Precision Floating-Point Valuesvdppd vdppd/ COMISSCOMISScomissLCompare Scalar Ordered Single-Precision Floating-Point Values and Set EFLAGScomissCOMISScomissCOMISS'(https://www.felixcloutier.com/x86/comiss VBLENDVPS VBLENDVPS vblendvps= Variable Blend Packed Single Precision Floating-Point Values vblendvps  vblendvps/  vblendvps  vblendvps2  VGETMANTSH VGETMANTSH vgetmantshKExtract Normalized Mantissa from Scalar Half-Precision Floating-Point Value vgetmantshR vgetmantsh$R vgetmantshR vgetmantsh$R vgetmantshRR vgetmantshRR,https://www.felixcloutier.com/x86/vgetmantsh PUNPCKHWD PUNPCKHWD punpckhwd7Unpack and Interleave High-Order Words into Doublewords punpckhwd  punpckhwd+  punpckhwd punpckhwd/Jhttps://www.felixcloutier.com/x86/punpckhbw:punpckhwd:punpckhdq:punpckhqdqVMINPDVMINPDvminpd<Return Minimum Packed Double-Precision Floating-Point Valuesvminpd=HvminpdHvminpd?HvminpdHvminpdAHvminpdHvminpd=Hvminpd vminpdHvminpd/ vminpd?Hvminpd vminpdHvminpd2 vminpdAHvminpdHvminpdRHvminpdRHJNBJNBjnbJump if not below (CF == 0)jnbJCCNjnbJCCOPACKUSDWPACKUSDWpackusdw4Pack Doublewords into Words with Unsigned Saturationpackusdwpackusdw/*https://www.felixcloutier.com/x86/packusdwLEALEAleaLoad Effective AddressleawLEAW "lealLEAL"%https://www.felixcloutier.com/x86/lea VPCOMPRESSQ VPCOMPRESSQ vpcompressqFStore Sparse Packed Quadword Integer Values into Dense Memory/Register  vpcompressqH vpcompressq0H vpcompressqH vpcompressq3H vpcompressqH vpcompressq6H vpcompressqH vpcompressqH vpcompressqH vpcompressq/H vpcompressq2H vpcompressq5H-https://www.felixcloutier.com/x86/vpcompressq VFMADDSUBPD VFMADDSUBPD vfmaddsubpdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Values vfmaddsubpd$ vfmaddsubpd/$ vfmaddsubpd/$ vfmaddsubpd$ vfmaddsubpd2$ vfmaddsubpd2$ VCVTUQQ2PD VCVTUQQ2PD vcvtuqq2pdZConvert Packed Unsigned Quadword Integers to Packed Double-Precision Floating-Point Values vcvtuqq2pd=J vcvtuqq2pd?J vcvtuqq2pdAJ vcvtuqq2pdJ vcvtuqq2pdJ vcvtuqq2pdJ vcvtuqq2pd=J vcvtuqq2pdJ vcvtuqq2pd?J vcvtuqq2pdJ vcvtuqq2pdAJ vcvtuqq2pdJ vcvtuqq2pdQJ vcvtuqq2pdQJ,https://www.felixcloutier.com/x86/vcvtuqq2pdTZCNTTZCNTtzcnt&Count the Number of Trailing Zero Bitstzcntw  4tzcntw $4tzcntl4tzcntl'4'https://www.felixcloutier.com/x86/tzcntVBLENDPDVBLENDPDvblendpd3Blend Packed Double Precision Floating-Point Valuesvblendpd vblendpd/ vblendpd vblendpd2 KXNORQKXNORQkxnorq!Bitwise Logical XNOR 64-bit MaskskxnorqI=https://www.felixcloutier.com/x86/kxnorw:kxnorb:kxnorq:kxnord VFMADDSUBPS VFMADDSUBPS vfmaddsubpsXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Values vfmaddsubps$ vfmaddsubps/$ vfmaddsubps/$ vfmaddsubps$ vfmaddsubps2$ vfmaddsubps2$ VEXPANDPD VEXPANDPD vexpandpdKLoad Sparse Packed Double-Precision Floating-Point Values from Dense Memory  vexpandpdK vexpandpdH vexpandpdH vexpandpd/K vexpandpd2H vexpandpd5H vexpandpdK vexpandpd/K vexpandpdH vexpandpd2H vexpandpdH vexpandpd5H+https://www.felixcloutier.com/x86/vexpandpdVSQRTPHVSQRTPHvsqrtphCCompute Square Roots of Packed Half-Precision Floating-Point Valuesvsqrtph<Kvsqrtph>Kvsqrtph@RvsqrtphKvsqrtphKvsqrtphRvsqrtph<KvsqrtphKvsqrtph>KvsqrtphKvsqrtph@RvsqrtphRvsqrtphQRvsqrtphQR)https://www.felixcloutier.com/x86/vsqrtphAESKEYGENASSISTAESKEYGENASSISTaeskeygenassistAES Round Key Generation Assistaeskeygenassist'aeskeygenassist/'1https://www.felixcloutier.com/x86/aeskeygenassist VPMOVZXWD VPMOVZXWD vpmovzxwdDMove Packed Word Integers to Doubleword Integers with Zero Extension vpmovzxwdH vpmovzxwdH vpmovzxwdH vpmovzxwd+H vpmovzxwd/H vpmovzxwd2H vpmovzxwd  vpmovzxwdH vpmovzxwd+  vpmovzxwd+H vpmovzxwd! vpmovzxwdH vpmovzxwd/! vpmovzxwd/H vpmovzxwdH vpmovzxwd2HMOVLHPSMOVLHPSmovlhps>Move Packed Single-Precision Floating-Point Values Low to HighmovlhpsMOVLHPS)https://www.felixcloutier.com/x86/movlhpsVPERMT2QVPERMT2Qvpermt2q=Full Permute of Quadwords From Two Tables Overwriting a Table vpermt2q=Hvpermt2qHvpermt2q?Hvpermt2qHvpermt2qAHvpermt2qHvpermt2q=Hvpermt2qHvpermt2q?Hvpermt2qHvpermt2qAHvpermt2qHPhttps://www.felixcloutier.com/x86/vpermt2w:vpermt2d:vpermt2q:vpermt2ps:vpermt2pdBSRBSRbsrBit Scan ReversebsrwBSRW  bsrwBSRW $bsrlBSRLbsrlBSRL'%https://www.felixcloutier.com/x86/bsrVMOVSSVMOVSSvmovss2Move Scalar Single-Precision Floating-Point Values vmovss(Hvmovss'Hvmovss' vmovss'Hvmovss' vmovss'HvmovssHvmovss vmovssHVPSRLDVPSRLDvpsrld*Shift Packed Doubleword Data Right Logicalvpsrld9Hvpsrld:Hvpsrld;HvpsrldHvpsrldHvpsrld/HvpsrldHvpsrldHvpsrld/HvpsrldHvpsrldHvpsrld/Hvpsrld9Hvpsrld vpsrldHvpsrld vpsrldHvpsrld/ vpsrld/Hvpsrld:Hvpsrld!vpsrldHvpsrld!vpsrldHvpsrld/!vpsrld/Hvpsrld;HvpsrldHvpsrldHvpsrld/H SHA256RNDS2 SHA256RNDS2 sha256rnds2&Perform Two Rounds of SHA256 Operation sha256rnds2( sha256rnds2/(-https://www.felixcloutier.com/x86/sha256rnds2VBROADCASTF64X4VBROADCASTF64X4vbroadcastf64x47Broadcast Four Double-Precision Floating-Point Elementsvbroadcastf64x42Hvbroadcastf64x42H VEXTRACTF64X4 VEXTRACTF64X4 vextractf64x4AExtract 256 Bits of Packed Double-Precision Floating-Point Values vextractf64x4H vextractf64x43H vextractf64x4H vextractf64x42HVMULPHVMULPHvmulph4Multiply Packed Half-Precision Floating-Point Valuesvmulph<KvmulphKvmulph>KvmulphKvmulph@RvmulphRvmulph<KvmulphKvmulph>KvmulphKvmulph@RvmulphRvmulphQRvmulphQR(https://www.felixcloutier.com/x86/vmulphVMOVDQAVMOVDQAvmovdqaMove Aligned Double Quadwordvmovdqa vmovdqa/ vmovdqa vmovdqa2 vmovdqa/ vmovdqa2 VPABSDVPABSDvpabsd,Packed Absolute Value of Doubleword Integersvpabsd9Hvpabsd:Hvpabsd;HvpabsdHvpabsdHvpabsdHvpabsd9Hvpabsd vpabsdHvpabsd/ vpabsd:Hvpabsd!vpabsdHvpabsd2!vpabsd;HvpabsdHPCMPGTDPCMPGTDpcmpgtd:Compare Packed Signed Doubleword Integers for Greater Thanpcmpgtd pcmpgtd+ pcmpgtdpcmpgtd/9https://www.felixcloutier.com/x86/pcmpgtb:pcmpgtw:pcmpgtdKANDBKANDBkandbBitwise Logical AND 8-bit MaskskandbJ9https://www.felixcloutier.com/x86/kandw:kandb:kandq:kanddCMOVNGECMOVNGEcmovnge'Move if not greater or equal (SF != OF)cmovngew  cmovngew $cmovngelcmovngel' VCVTTPD2QQ VCVTTPD2QQ vcvttpd2qqaConvert with Truncation Packed Double-Precision Floating-Point Values to Packed Quadword Integers vcvttpd2qq=J vcvttpd2qq?J vcvttpd2qqAJ vcvttpd2qqJ vcvttpd2qqJ vcvttpd2qqJ vcvttpd2qq=J vcvttpd2qqJ vcvttpd2qq?J vcvttpd2qqJ vcvttpd2qqAJ vcvttpd2qqJ vcvttpd2qqRJ vcvttpd2qqRJ,https://www.felixcloutier.com/x86/vcvttpd2qqPMOVSXBQPMOVSXBQpmovsxbqBMove Packed Byte Integers to Quadword Integers with Sign Extensionpmovsxbqpmovsxbq$VPANDVPANDvpandPacked Bitwise Logical ANDvpand vpand/ vpand!vpand2! VPBROADCASTQ VPBROADCASTQ vpbroadcastqBroadcast Quadword Integer vpbroadcastqH vpbroadcastqH vpbroadcastqH vpbroadcastq+H vpbroadcastq+H vpbroadcastq+H vpbroadcastq! vpbroadcastqH vpbroadcastq+! vpbroadcastq+H vpbroadcastq! vpbroadcastqH vpbroadcastq+! vpbroadcastq+H vpbroadcastqH vpbroadcastq+HUhttps://www.felixcloutier.com/x86/vpbroadcastb:vpbroadcastw:vpbroadcastd:vpbroadcastqVPSUBDVPSUBDvpsubd#Subtract Packed Doubleword Integersvpsubd9HvpsubdHvpsubd:HvpsubdHvpsubd;HvpsubdHvpsubd9Hvpsubd vpsubdHvpsubd/ vpsubd:Hvpsubd!vpsubdHvpsubd2!vpsubd;HvpsubdHVMOVLPSVMOVLPSvmovlps6Move Low Packed Single-Precision Floating-Point Valuesvmovlps+ vmovlps+Hvmovlps+ vmovlps+H VPEXPANDQ VPEXPANDQ vpexpandqELoad Sparse Packed Quadword Integer Values from Dense Memory/Register  vpexpandqH vpexpandqH vpexpandqH vpexpandq/H vpexpandq2H vpexpandq5H vpexpandqH vpexpandq/H vpexpandqH vpexpandq2H vpexpandqH vpexpandq5H+https://www.felixcloutier.com/x86/vpexpandqVPSUBWVPSUBWvpsubwSubtract Packed Word IntegersvpsubwIvpsubw/IvpsubwIvpsubw2IvpsubwIvpsubw5Ivpsubw vpsubwIvpsubw/ vpsubw/Ivpsubw!vpsubwIvpsubw2!vpsubw2IvpsubwIvpsubw5I VEXTRACTPS VEXTRACTPS vextractps4Extract Packed Single Precision Floating-Point Value vextractps  vextractpsH vextractps'  vextractps'H VPEXPANDD VPEXPANDD vpexpanddGLoad Sparse Packed Doubleword Integer Values from Dense Memory/Register  vpexpanddH vpexpanddH vpexpanddH vpexpandd/H vpexpandd2H vpexpandd5H vpexpanddH vpexpandd/H vpexpanddH vpexpandd2H vpexpanddH vpexpandd5H+https://www.felixcloutier.com/x86/vpexpanddKXORDKXORDkxord Bitwise Logical XOR 32-bit MaskskxordI9https://www.felixcloutier.com/x86/kxorw:kxorb:kxorq:kxordVRCP14PDVRCP14PDvrcp14pdPCompute Approximate Reciprocals of Packed Double-Precision Floating-Point Values vrcp14pd=Hvrcp14pd?Hvrcp14pdAHvrcp14pdHvrcp14pdHvrcp14pdHvrcp14pd=Hvrcp14pdHvrcp14pd?Hvrcp14pdHvrcp14pdAHvrcp14pdH*https://www.felixcloutier.com/x86/vrcp14pdMOVBEMOVBEmovbeMove Data After Swapping Bytesmovbew $/movbel'/movbew$ /movbel'/'https://www.felixcloutier.com/x86/movbePMULHRSWPMULHRSWpmulhrswOPacked Multiply Signed Word Integers and Store High Result with Round and Scalepmulhrswpmulhrsw+pmulhrswpmulhrsw/*https://www.felixcloutier.com/x86/pmulhrswCVTSS2SICVTSS2SIcvtss2si9Convert Scalar Single-Precision FP Value to Dword Integercvtss2siCVTSS2SLcvtss2siCVTSS2SL'*https://www.felixcloutier.com/x86/cvtss2siRCRRCRrcrRotate Right through Carry FlagrcrbRCRB rcrbRCRB rcrbRCRB rcrwRCRW rcrwRCRW rcrwRCRW rcrlRCRLrcrlRCRLrcrlRCRLrcrbRCRB#rcrbRCRB#rcrbRCRB#rcrwRCRW$rcrwRCRW$rcrwRCRW$rcrlRCRL'rcrlRCRL'rcrlRCRL'1https://www.felixcloutier.com/x86/rcl:rcr:rol:rorVMOVUPDVMOVUPDvmovupd<Move Unaligned Packed Double-Precision Floating-Point Valuesvmovupd0HvmovupdHvmovupd3HvmovupdHvmovupd6HvmovupdHvmovupd/Hvmovupd2Hvmovupd5Hvmovupd vmovupdHvmovupd/ vmovupd/Hvmovupd vmovupdHvmovupd2 vmovupd2HvmovupdHvmovupd5Hvmovupd/ vmovupd/Hvmovupd2 vmovupd2Hvmovupd5HCMOVGECMOVGEcmovge#Move if greater or equal (SF == OF)cmovgew  cmovgew $cmovgelcmovgel'SFENCESFENCEsfence Store FencesfenceSFENCE (https://www.felixcloutier.com/x86/sfence VPMOVZXDQ VPMOVZXDQ vpmovzxdqHMove Packed Doubleword Integers to Quadword Integers with Zero Extension vpmovzxdqH vpmovzxdqH vpmovzxdqH vpmovzxdq+H vpmovzxdq/H vpmovzxdq2H vpmovzxdq  vpmovzxdqH vpmovzxdq+  vpmovzxdq+H vpmovzxdq! vpmovzxdqH vpmovzxdq/! vpmovzxdq/H vpmovzxdqH vpmovzxdq2HVPMOVSQWVPMOVSQWvpmovsqwIDown Convert Packed Quadword Values to Word Values with Signed Saturation vpmovsqwHvpmovsqw(HvpmovsqwHvpmovsqw,HvpmovsqwHvpmovsqw0HvpmovsqwHvpmovsqwHvpmovsqwHvpmovsqw'Hvpmovsqw+Hvpmovsqw/H<https://www.felixcloutier.com/x86/vpmovqw:vpmovsqw:vpmovusqwVPSHUFDVPSHUFDvpshufdShuffle Packed Doublewordsvpshufd9Hvpshufd:Hvpshufd;HvpshufdHvpshufdHvpshufdHvpshufd9Hvpshufd vpshufdHvpshufd/ vpshufd:Hvpshufd!vpshufdHvpshufd2!vpshufd;HvpshufdHMOVNTDQAMOVNTDQAmovntdqa.Load Double Quadword Non-Temporal Aligned Hintmovntdqa/*https://www.felixcloutier.com/x86/movntdqaVRSQRTSSVRSQRTSSvrsqrtssQCompute Reciprocal of Square Root of Scalar Single-Precision Floating-Point Valuevrsqrtss vrsqrtss' VRSQRTPHVRSQRTPHvrsqrtphRCompute Reciprocals of Square Roots of Packed Half-Precision Floating-Point Values vrsqrtph<Kvrsqrtph>Kvrsqrtph@RvrsqrtphKvrsqrtphKvrsqrtphRvrsqrtph<KvrsqrtphKvrsqrtph>KvrsqrtphKvrsqrtph@RvrsqrtphR*https://www.felixcloutier.com/x86/vrsqrtphMOVQMOVQmovq Move QuadwordmovqMOVQ movqMOVQ+ movqMOVQmovqMOVQ+movqMOVQ+ movqMOVQ+&https://www.felixcloutier.com/x86/movqPACKUSWBPACKUSWBpackuswb.Pack Words into Bytes with Unsigned Saturationpackuswb packuswb+ packuswbpackuswb/*https://www.felixcloutier.com/x86/packuswbVHADDPDVHADDPDvhaddpdPacked Double-FP Horizontal Addvhaddpd vhaddpd/ vhaddpd vhaddpd2 PSRLDQPSRLDQpsrldq*Shift Packed Double Quadword Right Logicalpsrldq(https://www.felixcloutier.com/x86/psrldq VPHADDUBQ VPHADDUBQ vphaddubq/Packed Horizontal Add Unsigned Byte to Quadword vphaddubq" vphaddubq/" VCVTSS2SH VCVTSS2SH vcvtss2shJConvert Scalar Single-Precision FP Value to Scalar Half-Precision FP Value vcvtss2shR vcvtss2sh'R vcvtss2shR vcvtss2sh'R vcvtss2shQR vcvtss2shQR+https://www.felixcloutier.com/x86/vcvtss2shVPSRAVWVPSRAVWvpsravw0Variable Shift Packed Word Data Right Arithmetic vpsravwIvpsravw/IvpsravwIvpsravw2IvpsravwIvpsravw5IvpsravwIvpsravw/IvpsravwIvpsravw2IvpsravwIvpsravw5I9https://www.felixcloutier.com/x86/vpsravw:vpsravd:vpsravqVPADDUSBVPADDUSBvpaddusb:Add Packed Unsigned Byte Integers with Unsigned SaturationvpaddusbIvpaddusb/IvpaddusbIvpaddusb2IvpaddusbIvpaddusb5Ivpaddusb vpaddusbIvpaddusb/ vpaddusb/Ivpaddusb!vpaddusbIvpaddusb2!vpaddusb2IvpaddusbIvpaddusb5IVPDPBSUDVPDPBSUDvpdpbsudHPacked Dot Product of Signed-by-Unsinged Byte subvectors into DoublewordvpdpbsudXvpdpbsud/XvpdpbsudXvpdpbsud2XVRSQRTPSVRSQRTPSvrsqrtpsTCompute Reciprocals of Square Roots of Packed Single-Precision Floating-Point Valuesvrsqrtps vrsqrtps/ vrsqrtps vrsqrtps2  VCVTTPS2UDQ VCVTTPS2UDQ vcvttps2udqrConvert with Truncation Packed Single-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Values vcvttps2udq9H vcvttps2udq:H vcvttps2udq;H vcvttps2udqH vcvttps2udqH vcvttps2udqH vcvttps2udq9H vcvttps2udqH vcvttps2udq:H vcvttps2udqH vcvttps2udq;H vcvttps2udqH vcvttps2udqRH vcvttps2udqRH-https://www.felixcloutier.com/x86/vcvttps2udqJECXZJECXZjecxzJump if ECX register is 0jecxzJCXZLNVPSLLVDVPSLLVDvpsllvd2Variable Shift Packed Doubleword Data Left Logicalvpsllvd9HvpsllvdHvpsllvd:HvpsllvdHvpsllvd;HvpsllvdHvpsllvd9Hvpsllvd!vpsllvdHvpsllvd/!vpsllvd:Hvpsllvd!vpsllvdHvpsllvd2!vpsllvd;HvpsllvdH9https://www.felixcloutier.com/x86/vpsllvw:vpsllvd:vpsllvqKTESTQKTESTQktestq#Bit Test 64-bit Masks and Set FlagsktestqI=https://www.felixcloutier.com/x86/ktestw:ktestb:ktestq:ktestd VCVTPS2UQQ VCVTPS2UQQ vcvtps2uqq`Convert Packed Single Precision Floating-Point Values to Packed Unsigned Quadword Integer Values vcvtps2uqq8J vcvtps2uqq9J vcvtps2uqq:J vcvtps2uqqJ vcvtps2uqqJ vcvtps2uqqJ vcvtps2uqq8J vcvtps2uqqJ vcvtps2uqq9J vcvtps2uqqJ vcvtps2uqq:J vcvtps2uqqJ vcvtps2uqqQJ vcvtps2uqqQJ,https://www.felixcloutier.com/x86/vcvtps2uqq VFNMSUB213PH VFNMSUB213PH vfnmsub213phOFused Negative Multiply-Subtract of Packed Half-Precision Floating-Point Values vfnmsub213ph<K vfnmsub213phK vfnmsub213ph>K vfnmsub213phK vfnmsub213ph@R vfnmsub213phR vfnmsub213ph<K vfnmsub213phK vfnmsub213ph>K vfnmsub213phK vfnmsub213ph@R vfnmsub213phR vfnmsub213phQR vfnmsub213phQRlhttps://www.felixcloutier.com/x86/vfmsub132ph:vfnmsub132ph:vfmsub213ph:vfnmsub213ph:vfmsub231ph:vfnmsub231phVMOVDDUPVMOVDDUPvmovddup Move One Double-FP and DuplicatevmovddupHvmovddupHvmovddupHvmovddup+Hvmovddup2Hvmovddup5Hvmovddup vmovddupHvmovddup+ vmovddup+Hvmovddup vmovddupHvmovddup2 vmovddup2HvmovddupHvmovddup5H VFNMADD231PD VFNMADD231PD vfnmadd231pdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Values vfnmadd231pd=H vfnmadd231pdH vfnmadd231pd?H vfnmadd231pdH vfnmadd231pdAH vfnmadd231pdH vfnmadd231pd=H vfnmadd231pd# vfnmadd231pdH vfnmadd231pd/# vfnmadd231pd?H vfnmadd231pd# vfnmadd231pdH vfnmadd231pd2# vfnmadd231pdAH vfnmadd231pdH vfnmadd231pdQH vfnmadd231pdQHHhttps://www.felixcloutier.com/x86/vfnmadd132pd:vfnmadd213pd:vfnmadd231pd